Rearrangement of incomplete multi-omics datasets combined with ComDim for evaluating replicate cross-platform variability and batch influence
Autor: | Olivier Chapleur, Delphine Jouan-Rimbaud Bouveresse, Douglas N. Rutledge, Francesc Puig-Castellví, Laurent Mazéas |
---|---|
Přispěvatelé: | Procédés biotechnologiques au service de l'environnement (UR PROSE), Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE) |
Jazyk: | angličtina |
Rok vydání: | 2021 |
Předmět: |
0303 health sciences
Computer science Process Chemistry and Technology Replicate computer.software_genre Data type Computer Science Applications Analytical Chemistry 03 medical and health sciences 0302 clinical medicine [CHIM.ANAL]Chemical Sciences/Analytical chemistry Component (UML) Cross-platform Multi omics DECIPHER Data mining computer 030217 neurology & neurosurgery Spectroscopy Software ComputingMilieux_MISCELLANEOUS 030304 developmental biology Block (data storage) |
Zdroj: | Chemometrics and Intelligent Laboratory Systems Chemometrics and Intelligent Laboratory Systems, Elsevier, 2021, 218, pp.104422. ⟨10.1016/j.chemolab.2021.104422⟩ |
ISSN: | 0169-7439 |
DOI: | 10.1016/j.chemolab.2021.104422⟩ |
Popis: | Multi-omics studies can highlight the interrelationships among data across different layers of biological information. However, methods for the unsupervised analysis of multi-block data do not take the individual variability across batches into account and cannot deal with omics datasets when they present different numbers of replicates. We have explored three different data arrangement strategies to tackle these limitations. Several multi-block methods can be used to decipher the common variations across blocks and to determine the contribution of each block to each common component. In this study the ComDim method was used to compare these rearrangement strategies for three multi-omics datasets. We found that arranging the data using the ‘replicate by blocks’ strategy, where each block comprises data from only one replicate independently of its data type, provided the most insightful results. ComDim allowed the evaluation of the variability across the replicate blocks, confirming the existence of batch effects in some of the studies. Moreover, since the contributions of these batch effects were separated from the other contributions, the coordinated biological responses common across the different blocks was characterized for each data type. |
Databáze: | OpenAIRE |
Externí odkaz: |