Disease Risk of Missense Mutations Using Structural Inference from Predicted Function
Autor: | Kai Wang, Ram Samudrala, Michael L. Cunningham, Orapin V. Horst, Jeremy A. Horst |
---|---|
Rok vydání: | 2010 |
Předmět: |
Models
Molecular Sequence analysis Mutation Missense Biology medicine.disease_cause Polymorphism Single Nucleotide Risk Assessment Biochemistry Article Substitution matrix Protein structure Risk Factors Databases Genetic Lac Repressors OMIM : Online Mendelian Inheritance in Man medicine Humans Missense mutation Computer Simulation Disease Databases Protein Molecular Biology Sequence (medicine) Genetics Mutation Protein Stability Cell Biology General Medicine Markov Chains Protein Structure Tertiary Amino Acid Substitution Mutant Proteins Neural Networks Computer Sequence Alignment Algorithms Function (biology) |
Zdroj: | Current Protein & Peptide Science. 11:573-588 |
ISSN: | 1389-2037 |
DOI: | 10.2174/138920310794109139 |
Popis: | Advancements in sequencing techniques place personalized genomic medicine upon the horizon, bringing along the responsibility of clinicians to understand the likelihood for a mutation to cause disease, and of scientists to separate etiology from nonpathologic variability. Pathogenicity is discernable from patterns of interactions between a missense mutation, the surrounding protein structure, and intermolecular interactions. Physicochemical stability calculations are not accessible without structures, as is the case for the vast majority of human proteins, so diagnostic accuracy remains in infancy. To model the effects of missense mutations on functional stability without structure, we combine novel protein sequence analysis algorithms to discern spatial distributions of sequence, evolutionary, and physicochemical conservation, through a new approach to optimize component selection. Novel components include a combinatory substitution matrix and two heuristic algorithms that detect positions which confer structural support to interaction interfaces. The method reaches 0.91 AUC in ten-fold cross-validation to predict alteration of function for 6,392 in vitro mutations. For clinical utility we trained the method on 7,022 disease associated missense mutations within the Online Mendelian inheritance in man amongst a larger randomized set. In a blinded prospective test to delineate mutations unique to 186 patients with craniosynostosis from those in the 95 highly variant Coriell controls and 1000 age matched controls, we achieved roughly 1/3 sensitivity and perfect specificity. The component algorithms retained during machine learning constitute novel protein sequence analysis techniques to describe environments supporting neutrality or pathology of mutations. This approach to pathogenetics enables new insight into the mechanistic relationship of missense mutations to disease phenotypes in our patients. |
Databáze: | OpenAIRE |
Externí odkaz: |