Dog10K_Boxer_Tasha_1.0: A Long-Read Assembly of the Dog Reference Genome

Autor: Xiangquan Zhang, Christophe Hitte, Elaine A. Ostrander, Patrick Masterson, Terence Murphy, Yan-Hu Liu, Jeffrey M. Kidd, S. Emery, Brian W. Davis, Tosso Leeb, Ya-Ping Zhang, Reuben M. Buckley, Guo-Dong Wang, Vidhya Jagannathan
Přispěvatelé: University of Bern, Institut de Génétique et Développement de Rennes (IGDR), Structure Fédérative de Recherche en Biologie et Santé de Rennes ( Biosit : Biologie - Santé - Innovation Technologique )-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES), University of Michigan [Ann Arbor], University of Michigan System, National Center for Biotechnology Information (NCBI), Texas A&M University [College Station], National Human Genome Research Institute (NHGRI), Kunming Institute of Zoology, Chinese Academy of Sciences [Beijing] (CAS), 2019YFA0707101, The National Key R&D Program of China, R01GM140135, National Institutes of Health, Université de Rennes (UR)-Centre National de la Recherche Scientifique (CNRS)-Structure Fédérative de Recherche en Biologie et Santé de Rennes ( Biosit : Biologie - Santé - Innovation Technologique ), Kunming Institute of Zoology (KIZ)
Rok vydání: 2021
Předmět:
Zdroj: Jagannathan, Vidya; Hitte, Christophe; Kidd, Jeffrey M.; Masterson, Patrick; Murphy, Terence D.; Emery, Sarah; Davis, Brian; Buckley, Reuben M.; Liu, Yan-Hu; Zhang, Xiang-Quan; Leeb, Tosso; Zhang, Ya-Ping; Ostrander, Elaine A.; Wang, Guo-Dong (2021). Dog10K_Boxer_Tasha_1.0: A Long-Read Assembly of the Dog Reference Genome. Genes, 12(6) MDPI, Molecular Diversity Preservation International 10.3390/genes12060847
Genes
Genes, MDPI, 2021, 12 (6), pp.847. ⟨10.3390/genes12060847⟩
Genes, 2021, 12 (6), pp.847. ⟨10.3390/genes12060847⟩
Genes, Vol 12, Iss 847, p 847 (2021)
Volume 12
Issue 6
ISSN: 2073-4425
DOI: 10.48350/156573
Popis: The domestic dog has evolved to be an important biomedical model for studies regarding the genetic basis of disease, morphology and behavior. Genetic studies in the dog have relied on a draft reference genome of a purebred female boxer dog named “Tasha” initially published in 2005. Derived from a Sanger whole genome shotgun sequencing approach coupled with limited clone-based sequencing, the initial assembly and subsequent updates have served as the predominant resource for canine genetics for 15 years. While the initial assembly produced a good-quality draft, as with all assemblies produced at the time, it contained gaps, assembly errors and missing sequences, particularly in GC-rich regions, which are found at many promoters and in the first exons of protein-coding genes. Here, we present Dog10K_Boxer_Tasha_1.0, an improved chromosome-level highly contiguous genome assembly of Tasha created with long-read technologies that increases sequence contiguity >
100-fold, closes >
23,000 gaps of the CanFam3.1 reference assembly and improves gene annotation by identifying >
1200 new protein-coding transcripts. The assembly and annotation are available at NCBI under the accession GCF_000002285.5.
Databáze: OpenAIRE