Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction
Autor: | Bicui Ye, Qingyao Wu, Huaqing Min, Yonghui Xu, Hengjie Song |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2017 |
Předmět: |
0301 basic medicine
Scheme (programming language) Distribution (number theory) Computer science Genome-wide association study 02 engineering and technology Machine learning computer.software_genre Genome Article Domain (software engineering) Machine Learning 03 medical and health sciences 0202 electrical engineering electronic engineering information engineering Protein function prediction computer.programming_language Multidisciplinary Training set business.industry Proteins Construct (python library) Genomics 030104 developmental biology Metric (mathematics) 020201 artificial intelligence & image processing Artificial intelligence Transfer of learning business computer Algorithms Test data Genome-Wide Association Study |
Zdroj: | Scientific Reports |
ISSN: | 2045-2322 |
Popis: | Multi-Instance (MI) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with multiple instances. Many studies in this literature attempted to find an appropriate Multi-Instance Learning (MIL) method for genome-wide protein function prediction under a usual assumption, the underlying distribution from testing data (target domain, i.e., TD) is the same as that from training data (source domain, i.e., SD). However, this assumption may be violated in real practice. To tackle this problem, in this paper, we propose a Multi-Instance Metric Transfer Learning (MIMTL) approach for genome-wide protein function prediction. In MIMTL, we first transfer the source domain distribution to the target domain distribution by utilizing the bag weights. Then, we construct a distance metric learning method with the reweighted bags. At last, we develop an alternative optimization scheme for MIMTL. Comprehensive experimental evidence on seven real-world organisms verifies the effectiveness and efficiency of the proposed MIMTL approach over several state-of-the-art methods. |
Databáze: | OpenAIRE |
Externí odkaz: |