Pan-Cancer Analysis of TCGA Data Revealed Promising Reference Genes for qPCR Normalization
Autor: | Nataliya V. Melnikova, A. D. Beniaminov, Valentina A. Lakunina, Anna V. Kudryavtseva, Anastasiya V. Snezhkina, Alexey A. Dmitriev, George S. Krasnov |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2019 |
Předmět: |
0301 basic medicine
Mutation rate lcsh:QH426-470 PUM1 Pseudogene RNA-Seq reference genes Computational biology Biology 03 medical and health sciences 0302 clinical medicine Reference genes Gene expression Genetics cancer Gene Genetics (clinical) Original Research data normalization CrossHub TCGA 3. Good health lcsh:Genetics 030104 developmental biology Real-time polymerase chain reaction 030220 oncology & carcinogenesis quantitative PCR gene expression Molecular Medicine |
Zdroj: | Frontiers in Genetics, Vol 10 (2019) Frontiers in Genetics |
ISSN: | 1664-8021 |
Popis: | Quantitative PCR (qPCR) remains the most widely used technique for gene expression evaluation. Obtaining reliable data using this method requires reference genes (RGs) with stable mRNA level under experimental conditions. This issue is especially crucial in cancer studies because each tumor has a unique molecular portrait. The Cancer Genome Atlas (TCGA) project provides RNA-Seq data for thousands of samples corresponding to dozens of cancers and presents the basis for assessment of the suitability of genes as reference ones for qPCR data normalization. Using TCGA RNA-Seq data and previously developed CrossHub tool, we evaluated mRNA level of 32 traditionally used RGs in 12 cancer types, including those of lung, breast, prostate, kidney, and colon. We developed an 11-component scoring system for the assessment of gene expression stability. Among the 32 genes, PUM1 was one of the most stably expressed in the majority of examined cancers, whereas GAPDH, which is widely used as a RG, showed significant mRNA level alterations in more than a half of cases. For each of 12 cancer types, we suggested a pair of genes that are the most suitable for use as reference ones. These genes are characterized by high expression stability and absence of correlation between their mRNA levels. Next, the scoring system was expanded with several features of a gene: mutation rate, number of transcript isoforms and pseudogenes, participation in cancer-related processes on the basis of Gene Ontology, and mentions in PubMed-indexed articles. All the genes covered by RNA-Seq data in TCGA were analyzed using the expanded scoring system that allowed us to reveal novel promising RGs for each examined cancer type and identify several "universal" pan-cancer RG candidates, including SF3A1, CIAO1, and SFRS4. The choice of RGs is the basis for precise gene expression evaluation by qPCR. Here, we suggested optimal pairs of traditionally used RGs for 12 cancer types and identified novel promising RGs that demonstrate high expression stability and other features of reliable and convenient RGs (high expression level, low mutation rate, non-involvement in cancer-related processes, single transcript isoform, and absence of pseudogenes). |
Databáze: | OpenAIRE |
Externí odkaz: |