Transcript- and annotation-guided genome assembly of the European starling
Autor: | Simone Meddle, William B. Sherwin, Phillip Cassey, Matthew C. Brandley, Scott J. Werner, Melissa Bateson, Katarina C. Stuart, Lee A. Rollins, D. F. Clayton, Tim De Meyer, Richard Edwards, Katherine L. Buchanan, Gregory F. Ball, Natalie R. Hofmeister, David W. Burt, Yuanyuan Cheng, Wesley C. Warren |
---|---|
Rok vydání: | 2021 |
Předmět: |
Sturnus vulgaris
genome annotation assessment Population Sequence assembly Genomics Computational biology Biology full-length transcripts Genome Transcriptome Genetics Animals education genome Zebra finch Ecology Evolution Behavior and Systematics education.field_of_study Australia Biology and Life Sciences Molecular Sequence Annotation Gene Annotation STURNUS-VULGARIS PAN-GENOME genome assessment EVOLUTION INSIGHTS DISCOVERY Starlings POPULATION TRENDS genome assembly VISUALIZATION Nanopore sequencing Biotechnology |
Zdroj: | MOLECULAR ECOLOGY RESOURCES Stuart, K, Edwards, R, Cheng, Y, Warren, W, Burt, D, Sherwin, W, Hofmeister, N, Werner, S, Ball, G F, Bateson, M, Brandley, M, Buchanan, K L, Cassey, P, Clayton, D F, De Meyer, T, Meddle, S & Rollins, L 2022, ' Transcript-and annotation-guided genome assembly of the European starling ', Molecular Ecology Resources, vol. 22, no. 8, pp. 3141-3160 . https://doi.org/10.1111/1755-0998.13679 |
ISSN: | 1755-098X 1755-0998 |
Popis: | The European starling, Sturnus vulgaris, is an ecologically significant, globally invasive avian species that is also suffering from a major decline in its native range. Here, we present the genome assembly and long-read transcriptome of an Australian-sourced European starling (S. vulgaris vAU), and a second North American genome (S. vulgaris vNA), as complementary reference genomes for population genetic and evolutionary characterisation. S. vulgaris vAU combined 10x Genomics linked-reads, low-coverage Nanopore sequencing, and PacBio Iso-Seq full-length transcript scaffolding to generate a 1050 Mb assembly on 1,628 scaffolds (72.5 Mb scaffold N50). Species-specific transcript mapping and gene annotation revealed high structural and functional completeness (94.6% BUSCO completeness). Further scaffolding against the high-quality zebra finch (Taeniopygia guttata) genome assigned 98.6% of the assembly to 32 putative nuclear chromosome scaffolds. Rapid, recent advances in sequencing technologies and bioinformatics software have highlighted the need for evidence-based assessment of assembly decisions on a case-by-case basis. Using S. vulgaris vAU, we demonstrate how the multifunctional use of PacBio Iso-Seq transcript data and complementary homology-based annotation of sequential assembly steps (assessed using a new tool, SAAGA) can be used to assess, inform, and validate assembly workflow decisions. We also highlight some counter-intuitive behaviour in traditional BUSCO metrics, and present Buscomp, a complementary tool for assembly comparison designed to be robust to differences in assembly size and base-calling quality. Finally, we present a second starling assembly, S. vulgaris vNA, to facilitate comparative analysis and global genomic research on this ecologically important species. |
Databáze: | OpenAIRE |
Externí odkaz: |