Seq-InSite: sequence supersedes structure for protein interaction site prediction.
Autor: | Hosseini S; Department of Computer Science, University of Western Ontario, London, ON N6A 5B7, Canada., Golding GB; Department of Biology, McMaster University, Hamilton, ON L8S 4K1, Canada., Ilie L; Department of Computer Science, University of Western Ontario, London, ON N6A 5B7, Canada. |
---|---|
Jazyk: | angličtina |
Zdroj: | Bioinformatics (Oxford, England) [Bioinformatics] 2024 Jan 02; Vol. 40 (1). |
DOI: | 10.1093/bioinformatics/btad738 |
Abstrakt: | Motivation: Proteins accomplish cellular functions by interacting with each other, which makes the prediction of interaction sites a fundamental problem. As experimental methods are expensive and time consuming, computational prediction of the interaction sites has been studied extensively. Structure-based programs are the most accurate, while the sequence-based ones are much more widely applicable, as the sequences available outnumber the structures by two orders of magnitude. Ideally, we would like a tool that has the quality of the former and the applicability of the latter. Results: We provide here the first solution that achieves these two goals. Our new sequence-based program, Seq-InSite, greatly surpasses the performance of sequence-based models, matching the quality of state-of-the-art structure-based predictors, thus effectively superseding the need for models requiring structure. The predictive power of Seq-InSite is illustrated using an analysis of evolutionary conservation for four protein sequences. Availability and Implementation: Seq-InSite is freely available as a web server at http://seq-insite.csd.uwo.ca/ and as free source code, including trained models and all datasets used for training and testing, at https://github.com/lucian-ilie/Seq-InSite. (© The Author(s) 2024. Published by Oxford University Press.) |
Databáze: | MEDLINE |
Externí odkaz: |