MUREN: a robust and multi-reference approach of RNA-seq transcript normalization

Autor:	Yance Feng, Lei M. Li
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	RNA-seq Normalization Asymmetrically regulated transcription profiles (ART) Skewness Mode Multi-reference Computer applications to medicine. Medical informatics R858-859.7 Biology (General) QH301-705.5
Zdroj:	BMC Bioinformatics, Vol 22, Iss 1, Pp 1-18 (2021)
Druh dokumentu:	article
ISSN:	1471-2105
DOI:	10.1186/s12859-021-04288-0
Popis:	Abstract Background Normalization of RNA-seq data aims at identifying biological expression differentiation between samples by removing the effects of unwanted confounding factors. Explicitly or implicitly, the justification of normalization requires a set of housekeeping genes. However, the existence of housekeeping genes common for a very large collection of samples, especially under a wide range of conditions, is questionable. Results We propose to carry out pairwise normalization with respect to multiple references, selected from representative samples. Then the pairwise intermediates are integrated based on a linear model that adjusts the reference effects. Motivated by the notion of housekeeping genes and their statistical counterparts, we adopt the robust least trimmed squares regression in pairwise normalization. The proposed method (MUREN) is compared with other existing tools on some standard data sets. The goodness of normalization emphasizes on preserving possible asymmetric differentiation, whose biological significance is exemplified by a single cell data of cell cycle. MUREN is implemented as an R package. The code under license GPL-3 is available on the github platform: github.com/hippo-yf/MUREN and on the conda platform: anaconda.org/hippo-yf/r-muren. Conclusions MUREN performs the RNA-seq normalization using a two-step statistical regression induced from a general principle. We propose that the densities of pairwise differentiations are used to evaluate the goodness of normalization. MUREN adjusts the mode of differentiation toward zero while preserving the skewness due to biological asymmetric differentiation. Moreover, by robustly integrating pre-normalized counts with respect to multiple references, MUREN is immune to individual outlier samples.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/c0f7f7a58be6496bb926c5e20ec87e94 Zobrazit plný text záznamu View record in DOAJ Plný text ve formátu PDF