A revamped rat reference genome improves the discovery of genetic diversity in laboratory rats

Autor: Tristan V de Jong, Yanchao Pan, Pasi Rastas, Daniel Munro, Monika Tutaj, Huda Akil, Chris Benner, Apurva S Chitre, William Chow, Vincenza Colonna, Clifton L Dalgard, Wendy M Demos, Peter A Doris, Erik Garrison, Aron Geurts, Hakan M Gunturkun, Victor Guryev, Thibaut Hourlier, Kerstin Howe, Jun Huang, Ted Kalbfleisch, Panjun Kim, Ling Li, Spencer Mahaffey, Fergal J Martin, Pejman Mohammadi, Ayse Bilge Ozel, Oksana Polesskaya, Michal Pravenec, Pjotr Prins, Jonathan Sebat, Jennifer R Smith, Leah C Solberg Woods, Boris Tabakoff, Alan Tracey, Marcela Uliano-Silva, Flavia Villani, Hongyang Wang, Burt M Sharp, Francesca Telese, Zhihua Jiang, Laura Saba, Xusheng Wang, Terence D Murphy, Abraham A Palmer, Anne E Kwitek, Melinda (Mindy) R Dwinell, Robert W Williams, Jun Z Li, Hao Chen
Rok vydání: 2023
Popis: For over a decade, a large research community has relied on a flawed reference assembly of the genome ofRattus norvegicusknown as Rnor_6.0. The seventh assembly of the rat reference genome4mRatBN7.2, based on the inbred Brown Norway rat, corrects numerous misplaced segments, reduces base-level errors by approximately 9-fold, and increases contiguity by 290-fold, despite some remaining regions of potential misassembly. Gene annotations are now more complete, significantly improving the mapping precision of genomic, transcriptomic, and proteomics data sets. SimpleLiftOverfrom Rnor_6.0 to mRatBN7.2 misses ∼12% of variants. To facilitate the transition to mRatBN7.2, we performed a joint analysis of 163 whole genomes representing 120 strains/substrains. We defined 20.0 million sequence variations, of which 18.7 thousand are predicted to potentially impact the function of 6,677 genes. Phylogenetic analysis confirmed historical records and prior results and refined the ancestral relationship of these strains. Sixteen million polymorphisms segregate in the widely studied heterogeneous stock rat population, and 11313 million variants segregate collectively in the HXB/BXH and FXLE/LEXF strain families. Some inbred strains differ by only 132 M variants, and closely related substrains segregate by even fewer variants. We generated a new rat genetic map based on data from 1,893 heterogeneous stock rats and annotated transcription start sites and alternative polyadenylation sites.
Databáze: OpenAIRE