CENTRE: a gradient boosting algorithm for Cell-type-specific ENhancer-Target pREdiction.

Autor: Rapakoulia T; Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany., Lopez Ruiz De Vargas S; Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany., Omgba PA; Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany., Laupert V; Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany., Ulitsky I; Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany.; Department of Immunology and Regenerative Biology, Weizmann Institute of Science, Rehovot 76100, Israel.; Department of Molecular Neuroscience, Weizmann Institute of Science, Rehovot 76100, Israel., Vingron M; Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany.
Jazyk: angličtina
Zdroj: Bioinformatics (Oxford, England) [Bioinformatics] 2023 Nov 01; Vol. 39 (11).
DOI: 10.1093/bioinformatics/btad687
Abstrakt: Motivation: Identifying target promoters of active enhancers is a crucial step for realizing gene regulation and deciphering phenotypes and diseases. Up to now, several computational methods were developed to predict enhancer gene interactions, but they require either many epigenomic and transcriptomic experimental assays to generate cell-type (CT)-specific predictions or a single experiment applied to a large cohort of CTs to extract correlations between activities of regulatory elements. Thus, inferring CT-specific enhancer gene interactions in unstudied or poorly annotated CTs becomes a laborious and costly task.
Results: Here, we aim to infer CT-specific enhancer target interactions, using minimal experimental input. We introduce Cell-specific ENhancer Target pREdiction (CENTRE), a machine learning framework that predicts enhancer target interactions in a CT-specific manner, using only gene expression and ChIP-seq data for three histone modifications for the CT of interest. CENTRE exploits the wealth of available datasets and extracts cell-type agnostic statistics to complement the CT-specific information. CENTRE is thoroughly tested across many datasets and CTs and achieves equivalent or superior performance than existing algorithms that require massive experimental data.
Availability and Implementation: CENTRE's open-source code is available at GitHub via https://github.com/slrvv/CENTRE.
(© The Author(s) 2023. Published by Oxford University Press.)
Databáze: MEDLINE