Tree-Based Co-Clustering Identifies Chromatin Accessibility Patterns Associated With Hematopoietic Lineage Structure.

Autor: George TB; Department of Mathematics and Statistics, Georgetown University, Washington, DC, United States., Strawn NK; Department of Mathematics and Statistics, Georgetown University, Washington, DC, United States., Leviyang S; Department of Mathematics and Statistics, Georgetown University, Washington, DC, United States.
Jazyk: angličtina
Zdroj: Frontiers in genetics [Front Genet] 2021 Oct 01; Vol. 12, pp. 707117. Date of Electronic Publication: 2021 Oct 01 (Print Publication: 2021).
DOI: 10.3389/fgene.2021.707117
Abstrakt: Chromatin accessibility, as measured by ATACseq, varies between hematopoietic cell types in different lineages of the hematopoietic differentiation tree, e.g. T cells vs. B cells, but methods that associate variation in chromatin accessibility to the lineage structure of the differentiation tree are lacking. Using an ATACseq dataset recently published by the ImmGen consortium, we construct associations between chromatin accessibility and hematopoietic cell types using a novel co-clustering approach that accounts for the structure of the hematopoietic, differentiation tree. Under a model in which all loci and cell types within a co-cluster have a shared accessibility state, we show that roughly 80% of cell type associated accessibility variation can be captured through 12 cell type clusters and 20 genomic locus clusters, with the cell type clusters reflecting coherent components of the differentiation tree. Using publicly available ChIPseq datasets, we show that our clustering reflects transcription factor binding patterns with implications for regulation across cell types. We show that traditional methods such as hierarchical and kmeans clusterings lead to cell type clusters that are more dispersed on the tree than our tree-based algorithm. We provide a python package, chromcocluster, that implements the algorithms presented.
Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
(Copyright © 2021 George, Strawn and Leviyang.)
Databáze: MEDLINE