Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution
Autor: | Rita Derua, Mikaela Granvik, Lieven Thorrez, Katleen Lemaire, T. in’t Veld, L. Van Lommel, J. Buyse, S. De Coster, Lotte Goyvaerts, Raf Huttener, Frans Schuit, Etienne Waelkens |
---|---|
Rok vydání: | 2021 |
Předmět: |
Genome evolution
SLC2A4 Evolution ENO3 ALDOA ENCODE Genome PYGM Birds Evolution Molecular Transcript landscapes Missing genes biology.animal QH359-425 Animals Humans Sequencing artifacts Gene QH540-549.5 Phylogeny Ecology biology Genome Human Avian genomes Vertebrate General Medicine Gene Annotation Accelerated Evolutionary biology Vertebrates GLUT4 GC-content Research Article Reference genome |
Zdroj: | BMC Ecology and Evolution BMC Ecology and Evolution, Vol 21, Iss 1, Pp 1-18 (2021) |
ISSN: | 2730-7182 |
DOI: | 10.1186/s12862-021-01905-7 |
Popis: | Background Approximately 1000 protein encoding genes common for vertebrates are still unannotated in avian genomes. Are these genes evolutionary lost or are they not yet found for technical reasons? Using genome landscapes as a tool to visualize large-scale regional effects of genome evolution, we reexamined this question. Results On basis of gene annotation in non-avian vertebrate genomes, we established a list of 15,135 common vertebrate genes. Of these, 1026 were not found in any of eight examined bird genomes. Visualizing regional genome effects by our sliding window approach showed that the majority of these "missing" genes can be clustered to 14 regions of the human reference genome. In these clusters, an additional 1517 genes (often gene fragments) were underrepresented in bird genomes. The clusters of “missing” genes coincided with regions of very high GC content, particularly in avian genomes, making them “hidden” because of incomplete sequencing. Moreover, proteins encoded by genes in these sequencing refractory regions showed signs of accelerated protein evolution. As a proof of principle for this idea we experimentally characterized the mRNA and protein products of four "hidden" bird genes that are crucial for energy homeostasis in skeletal muscle: ALDOA, ENO3, PYGM and SLC2A4. Conclusions A least part of the “missing” genes in bird genomes can be attributed to an artifact caused by the difficulty to sequence regions with extreme GC% (“hidden” genes). Biologically, these “hidden” genes are of interest as they encode proteins that evolve more rapidly than the genome wide average. Finally we show that four of these “hidden” genes encode key proteins for energy metabolism in flight muscle. |
Databáze: | OpenAIRE |
Externí odkaz: |