Linking genotype to phenotype in multi-omics data of small sample

Autor:	Xinpeng Guo, Yafei Song, Shuhui Liu, Meihong Gao, Yang Qi, Xuequn Shang
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	Multi-omics Small sample SNP Gene Phenotype Biotechnology TP248.13-248.65 Genetics QH426-470
Zdroj:	BMC Genomics, Vol 22, Iss 1, Pp 1-11 (2021)
Druh dokumentu:	article
ISSN:	1471-2164
DOI:	10.1186/s12864-021-07867-w
Popis:	Abstract Background Genome-wide association studies (GWAS) that link genotype to phenotype represent an effective means to associate an individual genetic background with a disease or trait. However, single-omics data only provide limited information on biological mechanisms, and it is necessary to improve the accuracy for predicting the biological association between genotype and phenotype by integrating multi-omics data. Typically, gene expression data are integrated to analyze the effect of single nucleotide polymorphisms (SNPs) on phenotype. Such multi-omics data integration mainly follows two approaches: multi-staged analysis and meta-dimensional analysis, which respectively ignore intra-omics and inter-omics associations. Moreover, both approaches require omics data from a single sample set, and the large feature set of SNPs necessitates a large sample size for model establishment, but it is difficult to obtain multi-omics data from a single, large sample set. Results To address this problem, we propose a method of genotype-phenotype association based on multi-omics data from small samples. The workflow of this method includes clustering genes using a protein-protein interaction network and gene expression data, screening gene clusters with group lasso, obtaining SNP clusters corresponding to the selected gene clusters through expression quantitative trait locus data, integrating SNP clusters and corresponding gene clusters and phenotypes into three-layer network blocks, analyzing and predicting based on each block, and obtaining the final prediction by taking the average. Conclusions We compare this method to others using two datasets and find that our method shows better results in both cases. Our method can effectively solve the prediction problem in multi-omics data of small sample, and provide valuable resources for further studies on the fusion of more omics data.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/a88843c2b17d435b960cbd0d213b434f Zobrazit plný text záznamu View record in DOAJ Plný text ve formátu PDF