HEMEsPred: Structure-Based Ligand-Specific Heme Binding Residues Prediction by Using Fast-Adaptive Ensemble Learning Scheme
Autor: | Guifu Yang, Jian Zhang, Bo Gao, Haiting Chai, Zhiqiang Ma |
---|---|
Rok vydání: | 2018 |
Předmět: |
Models
Molecular 0301 basic medicine Heme binding Computer science Heme Machine learning computer.software_genre Protein Structure Secondary Machine Learning 03 medical and health sciences chemistry.chemical_compound Protein structure Robustness (computer science) Genetics Feature (machine learning) Databases Protein Protein secondary structure Binding Sites business.industry Applied Mathematics Computational Biology Ensemble learning 030104 developmental biology chemistry Benchmark (computing) Artificial intelligence Biological system business computer Algorithms Software Protein Binding Biotechnology |
Zdroj: | IEEE/ACM Transactions on Computational Biology and Bioinformatics. 15:147-156 |
ISSN: | 2374-0043 1545-5963 |
DOI: | 10.1109/tcbb.2016.2615010 |
Popis: | Heme is an essential biomolecule that widely exists in numerous extant organisms. Accurately identifying heme binding residues (HEMEs) is of great importance in disease progression and drug development. In this study, a novel predictor named HEMEsPred was proposed for predicting HEMEs. First, several sequence- and structure-based features, including amino acid composition, motifs, surface preferences, and secondary structure, were collected to construct feature matrices. Second, a novel fast-adaptive ensemble learning scheme was designed to overcome the serious class-imbalance problem as well as to enhance the prediction performance. Third, we further developed ligand-specific models considering that different heme ligands varied significantly in their roles, sizes, and distributions. Statistical test proved the effectiveness of ligand-specific models. Experimental results on benchmark datasets demonstrated good robustness of our proposed method. Furthermore, our method also showed good generalization capability and outperformed many state-of-art predictors on two independent testing datasets. HEMEsPred web server was available at http://www.inforstation.com/HEMEsPred/ for free academic use. |
Databáze: | OpenAIRE |
Externí odkaz: |