Strategies for analyzing highly enriched IP-chip datasets

Autor:	Simon Tavaré, Oscar M. Aparicio, Christopher J. Viggiani, Simon R.V. Knott
Jazyk:	angličtina
Rok vydání:	2009
Předmět:	Normalization (statistics) Chromatin Immunoprecipitation Population Information Storage and Retrieval Computational biology Biology Bioinformatics lcsh:Computer applications to medicine. Medical informatics Biochemistry 03 medical and health sciences 0302 clinical medicine Structural Biology education Molecular Biology lcsh:QH301-705.5 030304 developmental biology 0303 health sciences education.field_of_study Tiling array Applied Mathematics Methodology Article Gene Expression Profiling Computational Biology Chip Computer Science Applications Dynamic programming lcsh:Biology (General) A priori and a posteriori lcsh:R858-859.7 DNA microarray Chromatin immunoprecipitation 030217 neurology & neurosurgery
Zdroj:	BMC Bioinformatics, Vol 10, Iss 1, p 305 (2009) BMC Bioinformatics
ISSN:	1471-2105
Popis:	Background Chromatin immunoprecipitation on tiling arrays (ChIP-chip) has been employed to examine features such as protein binding and histone modifications on a genome-wide scale in a variety of cell types. Array data from the latter studies typically have a high proportion of enriched probes whose signals vary considerably (due to heterogeneity in the cell population), and this makes their normalization and downstream analysis difficult. Results Here we present strategies for analyzing such experiments, focusing our discussion on the analysis of Bromodeoxyruridine (BrdU) immunoprecipitation on tiling array (BrdU-IP-chip) datasets. BrdU-IP-chip experiments map large, recently replicated genomic regions and have similar characteristics to histone modification/location data. To prepare such data for downstream analysis we employ a dynamic programming algorithm that identifies a set of putative unenriched probes, which we use for both within-array and between-array normalization. We also introduce a second dynamic programming algorithm that incorporates a priori knowledge to identify and quantify positive signals in these datasets. Conclusion Highly enriched IP-chip datasets are often difficult to analyze with traditional array normalization and analysis strategies. Here we present and test a set of analytical tools for their normalization and quantification that allows for accurate identification and analysis of enriched regions.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ffb9c57bd36cf49982627f00879b11ae http://www.biomedcentral.com/1471-2105/10/305 Zobrazit plný text záznamu Full text from SpringerLink