Automated prediction of the clinical impact of structural copy number variations

Autor: Michaela Gaziova, Tomas Sladecek, Ondrej Pos, Martin Stevko, Werner Krampl, Zuzana Pos, Rastislav Hekel, Mario Hlavacka, Marcel Kucharik, Jan Radvanszky, Jaroslav Budis, Tomas Szemes
Předmět:
Zdroj: UnpayWall
ORCID
Microsoft Academic Graph
DOAJ-Articles
PubMed Central
Europe PubMed Central
Scientific Reports
Scientific Reports, Vol 12, Iss 1, Pp 1-14 (2022)
ISSN: 2045-2322
DOI: 10.1038/s41598-021-04505-z
Popis: Introduction: Copy number variants (CNVs) play an important role in many biological processes, including the development of genetic diseases, making them attractive targets for genetic analyses. The interpretation of the effect of structural variants is a challenging problem due to highly variable numbers of gene, regulatory or other genomic elements affected by the CNV. This led to the demand for the interpretation tools that would relieve researchers, laboratory diagnosticians, genetic counselors, and clinical geneticists from the laborious process of annotation and classification of CNVs. Materials and Methods: We designed a classifier method based on the annotations of CNVs from several publicly available databases. The attributes take into account gene elements, regulatory elements affected by the CNV, as well as other CNVs with known clinical significance that overlap the candidate CNV. We also describe the process of model selection and the construction of training, validation, and test set. Results: The presented approach achieved more than 98% prediction accuracy on both copy number loss and copy number gain variants and can be improved by imposing probability thresholds to eliminate low confidence predictions. Discussion: Method has shown considerable performance in predicting the clinical impact of CNVs and therefore has a great potential to guide users to more precise conclusions. The CNV annotation and pathogenicity prediction can be fully automated, relieving users of tedious interpretation processes. Availability and Implementation: The results can be reproduced by following instructions at {{https://github.com/tsladecek/isv}}.
Databáze: OpenAIRE
Nepřihlášeným uživatelům se plný text nezobrazuje