Organization of gene programs revealed by unsupervised analysis of diverse gene-trait associations.

Autor: Mizikovsky D; Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia., Naval Sanchez M; Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia., Nefzger CM; Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia., Cuellar Partida G; University of Queensland, Diamantina Institute, Brisbane, QLD, Australia., Palpant NJ; Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia.
Jazyk: angličtina
Zdroj: Nucleic acids research [Nucleic Acids Res] 2022 Aug 26; Vol. 50 (15), pp. e87.
DOI: 10.1093/nar/gkac413
Abstrakt: Genome wide association studies provide statistical measures of gene-trait associations that reveal how genetic variation influences phenotypes. This study develops an unsupervised dimensionality reduction method called UnTANGLeD (Unsupervised Trait Analysis of Networks from Gene Level Data) which organizes 16,849 genes into discrete gene programs by measuring the statistical association between genetic variants and 1,393 diverse complex traits. UnTANGLeD reveals 173 gene clusters enriched for protein-protein interactions and highly distinct biological processes governing development, signalling, disease, and homeostasis. We identify diverse gene networks with robust interactions but not associated with known biological processes. Analysis of independent disease traits shows that UnTANGLeD gene clusters are conserved across all complex traits, providing a simple and powerful framework to predict novel gene candidates and programs influencing orthogonal disease phenotypes. Collectively, this study demonstrates that gene programs co-ordinately orchestrating cell functions can be identified without reliance on prior knowledge, providing a method for use in functional annotation, hypothesis generation, machine learning and prediction algorithms, and the interpretation of diverse genomic data.
(© The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research.)
Databáze: MEDLINE