Joint modeling of an outcome variable and integrated omics datasets using GLM-PO2PLS.

Autor: Gu Z; Department of Data Science and Biostatistics, Julius Centre, UMC Utrecht, Utrecht, The Netherlands.; Medical Research Council Biostatistics Unit, University of Cambridge, Cambridge, UK., Uh HW; Department of Data Science and Biostatistics, Julius Centre, UMC Utrecht, Utrecht, The Netherlands., Houwing-Duistermaat J; Department of Data Science and Biostatistics, Julius Centre, UMC Utrecht, Utrecht, The Netherlands.; Department of Statistics, University of Leeds, Leeds, UK.; Department of Mathematics, Radboud University, Nijmegen, The Netherlands., El Bouhaddani S; Department of Data Science and Biostatistics, Julius Centre, UMC Utrecht, Utrecht, The Netherlands.
Jazyk: angličtina
Zdroj: Journal of applied statistics [J Appl Stat] 2024 Feb 21; Vol. 51 (13), pp. 2627-2651. Date of Electronic Publication: 2024 Feb 21 (Print Publication: 2024).
DOI: 10.1080/02664763.2024.2313458
Abstrakt: In many studies of human diseases, multiple omics datasets are measured. Typically, these omics datasets are studied one by one with the disease, thus the relationship between omics is overlooked. Modeling the joint part of multiple omics and its association to the outcome disease will provide insights into the complex molecular base of the disease. Several dimension reduction methods which jointly model multiple omics and two-stage approaches that model the omics and outcome in separate steps are available. Holistic one-stage models for both omics and outcome are lacking. In this article, we propose a novel one-stage method that jointly models an outcome variable with omics. We establish the model identifiability and develop EM algorithms to obtain maximum likelihood estimators of the parameters for normally and Bernoulli distributed outcomes. Test statistics are proposed to infer the association between the outcome and omics, and their asymptotic distributions are derived. Extensive simulation studies are conducted to evaluate the proposed model. The method is illustrated by modeling Down syndrome as outcome and methylation and glycomics as omics datasets. Here we show that our model provides more insight by jointly considering methylation and glycomics.
Competing Interests: The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
(© 2024 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.)
Databáze: MEDLINE