MapReduce Algorithms for Inferring Gene Regulatory Networks from Time-Series Microarray Data Using an Information-Theoretic Approach

Autor: Yasser Abduallah, Miguel Cervantes-Cervantes, Turki Turki, Zongxuan Du, Jason T. L. Wang, Kevin Byron
Rok vydání: 2017
Předmět:
0301 basic medicine
FOS: Computer and information sciences
Time Factors
Article Subject
Computer science
Molecular Networks (q-bio.MN)
Gene regulatory network
Information Theory
lcsh:Medicine
Cloud computing
Saccharomyces cerevisiae
General Biochemistry
Genetics and Molecular Biology

Regulatory molecules
Computational Engineering
Finance
and Science (cs.CE)

03 medical and health sciences
0302 clinical medicine
Quantitative Biology - Molecular Networks
Quantitative Biology - Genomics
Gene Regulatory Networks
Computer Science - Computational Engineering
Finance
and Science

Oligonucleotide Array Sequence Analysis
Regulation of gene expression
Genomics (q-bio.GN)
Data processing
General Immunology and Microbiology
Series (mathematics)
Microarray analysis techniques
business.industry
lcsh:R
Experimental data
General Medicine
030104 developmental biology
030220 oncology & carcinogenesis
FOS: Biological sciences
ComputingMethodologies_GENERAL
business
Algorithm
Algorithms
Research Article
Zdroj: BioMed Research International
BioMed Research International, Vol 2017 (2017)
DOI: 10.48550/arxiv.1704.06548
Popis: Gene regulation is a series of processes that control gene expression and its extent. The connections among genes and their regulatory molecules, usually transcription factors, and a descriptive model of such connections, are known as gene regulatory networks (GRNs). Elucidating GRNs is crucial to understand the inner workings of the cell and the complexity of gene interactions. To date, numerous algorithms have been developed to infer gene regulatory networks. However, as the number of identified genes increases and the complexity of their interactions is uncovered, networks and their regulatory mechanisms become cumbersome to test. Furthermore, prodding through experimental results requires an enormous amount of computation, resulting in slow data processing. Therefore, new approaches are needed to expeditiously analyze copious amounts of experimental data resulting from cellular GRNs. To meet this need, cloud computing is promising as reported in the literature. Here we propose new MapReduce algorithms for inferring gene regulatory networks on a Hadoop cluster in a cloud environment. These algorithms employ an information-theoretic approach to infer GRNs using time-series microarray data. Experimental results show that our MapReduce program is much faster than an existing tool while achieving slightly better prediction accuracy than the existing tool.
Comment: 19 pages, 5 figures
Databáze: OpenAIRE