Quality assessment and control of tissue specific RNA-seq libraries of Drosophila transgenic RNAi models

Autor: Takakazu Yokokura, Seiko Yoshikawa, Andreia J. Amaral, Margarida Gama-Carvalho, Francisco Brito, Tamar Chobanyan, David Van Vactor
Přispěvatelé: Repositório da Universidade de Lisboa
Jazyk: angličtina
Rok vydání: 2014
Předmět:
Zdroj: Frontiers in Genetics, Vol 5 (2014)
Repositório Científico de Acesso Aberto de Portugal
Repositório Científico de Acesso Aberto de Portugal (RCAAP)
instacron:RCAAP
Frontiers in Genetics
ISSN: 1664-8021
DOI: 10.3389/fgene.2014.00043
Popis: Copyright © 2014 Amaral, Brito, Chobanyan, Yoshikawa, Yokokura, Van Vactor and Gama-Carvalho. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
RNA-sequencing (RNA-seq) is rapidly emerging as the technology of choice for whole-transcriptome studies. However, RNA-seq is not a bias free technique. It requires large amounts of RNA and library preparation can introduce multiple artifacts, compounded by problems from later stages in the process. Nevertheless, RNA-seq is increasingly used in multiple studies, including the characterization of tissue-specific transcriptomes from invertebrate models of human disease. The generation of samples in this context is complex, involving the establishment of mutant strains and the delicate contamination prone process of dissecting the target tissue. Moreover, in order to achieve the required amount of RNA, multiple samples need to be pooled. Such datasets pose extra challenges due to the large variability that may occur between similar pools, mostly due to the presence of cells from surrounding tissues. Therefore, in addition to standard quality control of RNA-seq data, analytical procedures for control of "biological quality" are critical for successful comparison of gene expression profiles. In this study, the transcriptome of the central nervous system (CNS) of a Drosophila transgenic strain with neuronal-specific RNAi of an ubiquitous gene was profiled using RNA-seq. After observing the existence of an unusual variance in our dataset, we showed that the expression profile of a small panel of marker genes, including GAL4 under control of a tissue specific driver, can identify libraries with low levels of contamination from neighboring tissues, enabling the selection of a robust dataset for differential expression analysis. We further analyzed the potential of profiling a complex tissue to identify cell-type specific changes in response to target gene down-regulation. Finally, we showed that trimming 5' ends of reads decreases nucleotide frequency biases, increasing the coverage of protein coding genes with a potential positive impact in the incurrence of systematic technical errors.
Andreia J. Amaral was supported by a Marie Curie European Integration Grant (PERG-GA-2009-256595), an FCT post-doctoral grant (SFRH/BPD/65976/2009), and an EMBO long term fellowship (ALTF33-2010). Andreia J. Amaral, Francisco F. Brito, and Margarida Gama-Carvalho were further supported by FCT—PEst-OE/BIA/UI4046/2011 funds. Tamar Chobanyan, Seiko Yoshikawa, and Takakazu Yokokura were supported by OIST; David Van Vactor was also supported by grants from NINDS.
Databáze: OpenAIRE