Cell type–specific interpretation of noncoding variants using deep learning–based methods

Autor: Maria Sindeeva, Nikolay Chekanov, Manvel Avetisian, Tatiana I Shashkova, Nikita Baranov, Elian Malkin, Alexander Lapin, Olga Kardymon, Veniamin Fishman
Rok vydání: 2023
Předmět:
Zdroj: GigaScience. 12
ISSN: 2047-217X
DOI: 10.1093/gigascience/giad015
Popis: Interpretation of noncoding genomic variants is one of the most important challenges in human genetics. Machine learning methods have emerged recently as a powerful tool to solve this problem. State-of-the-art approaches allow prediction of transcriptional and epigenetic effects caused by noncoding mutations. However, these approaches require specific experimental data for training and cannot generalize across cell types where required features were not experimentally measured. We show here that available epigenetic characteristics of human cell types are extremely sparse, limiting those approaches that rely on specific epigenetic input. We propose a new neural network architecture, DeepCT, which can learn complex interconnections of epigenetic features and infer unmeasured data from any available input. Furthermore, we show that DeepCT can learn cell type–specific properties, build biologically meaningful vector representations of cell types, and utilize these representations to generate cell type–specific predictions of the effects of noncoding variations in the human genome.
Databáze: OpenAIRE