Vom 20.12.2025 bis 11.01.2026 ist die Universitätsbibliothek geschlossen. Ab dem 12.01.2026 gelten wieder die regulären Öffnungszeiten. Ausnahme: Medizinische Hauptbibliothek und Zentralbibliothek sind bereits ab 05.01.2026 wieder geöffnet. Weitere Informationen

Treffer: Mapping the space of protein binding sites with sequence-based protein language models Open Access.

Title:
Mapping the space of protein binding sites with sequence-based protein language models Open Access.
Source:
Bioinformatics; Jun2025, Vol. 41 Issue 6, p1-8, 8p
Database:
Complementary Index

Weitere Informationen

Motivation Binding sites are the key interfaces that determine a protein's biological activity, and therefore common targets for therapeutic intervention. Techniques that help us detect, compare, and contextualize binding sites are hence of immense interest to drug discovery. Results Here, we present an approach that integrates protein language models with a 3D tessellation technique to derive rich and versatile representations of binding sites that combine functional, structural, and evolutionary information with unprecedented detail. We demonstrate that the associated similarity metrics induce meaningful pocket clusterings by balancing local structure against global sequence effects. The resulting embeddings are shown to simplify a variety of downstream tasks: they help organize the 'pocketome' in a way that efficiently contextualizes new binding sites, construct performant druggability models, and define challenging train-test splits for believable benchmarking of pocket-centric machine-learning models. Availability and implementation A Python package that implements the EPoCS method is freely available at https://github.com/tugceoruc/epocs. [ABSTRACT FROM AUTHOR]

Copyright of Bioinformatics is the property of Oxford University Press / USA and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)