Statistical methods with exhaustive search in the identification of gene-gene interactions for colorectal cancer

Autor: Somayeh Kafaie, Ting Hu, Ling Xu
Rok vydání: 2020
Předmět:
Zdroj: Genetic epidemiologyREFERENCES. 45(2)
ISSN: 1098-2272
Popis: Though additive forms of heritability are primarily studied in genetics, nonlinear, non-additive gene-gene interactions, that is, epistasis, could explain a portion of the missing heritability in complex human diseases including cancer. In recent years, powerful computational methods have been introduced to understand multivariable genetic factors of these complex human diseases in extremely high-dimensional genome-wide data. In this study, we investigated the performance of three powerful methods, BOolean Operation-based Screening and Testing (BOOST), FastEpistasis, and Tree-based Epistasis Association Mapping (TEAM) to identify interacting genetic risk factors of colorectal cancer (CRC) for genome-wide association studies (GWAS). After quality-control based data preprocessing, we applied these three algorithms to a CRC GWAS data set, and selected the top-ranked 100 single-nucleotide polymorphism (SNP) pairs identified by each method (251 SNPs in total), among which 74 pairs were common between FastEpistasis and BOOST. The identified SNPs by BOOST, FastEpistasis, and TEAM mapped to 58, 57, and 62 genes, respectively. Some genes highlighted by our study, including MACF1, USP49, SMAD2, SMAD3, TGFBR1, and RHOA, have been detected in previous CRC-related research. We also identified some new genes with potential biological relevance to CRC such as CCDC32. Furthermore, we constructed the network of these top SNP pairs for three methods, and the patterns identified in the networks show that some SNPs including rs2412531, rs349699, and rs17142011 play a crucial role in the classification of disease status in our study.
Databáze: OpenAIRE