Genome maps across 26 human populations reveal population-specific patterns of structural variation
Autor: | Claire Yik Lok Chung, Ahmed Naguib, Alex Hastie, Walfred Ma, Michal Levy-Sakin, Ramakrishnan Rajagopalan, Alden King-Yung Leung, Justin Sibert, Steven Pastor, Eugene Y. C. Chow, Jennifer McCaffrey, Karen H. Y. Wong, Ming Xiao, Annie Poon, Chin Lin, Han Cao, Pui-Yan Kwok, Kevin Y. Yip, Catherine J. Chu, Eleanor Young, Wei-Ping Wang, Yulia Mostovoy, Nana Jin, Ting-Fung Chan, Ernest T. Lam, Le Li |
---|---|
Rok vydání: | 2019 |
Předmět: |
Male
0301 basic medicine Genetic Linkage Gene Dosage General Physics and Astronomy 02 engineering and technology Genome Segmental Duplications Genomic lcsh:Science Phylogeny Segmental duplication education.field_of_study Multidisciplinary Chromosome Mapping Genomics 021001 nanoscience & nanotechnology Segmental Duplications Female 0210 nano-technology Sequence Analysis Algorithms Human Science Genomic Structural Variation Population Biology Article Chromosomes General Biochemistry Genetics and Molecular Biology Structural variation 03 medical and health sciences Genetics Humans 1000 Genomes Project education Chromosomes Human Y Base Sequence Genome Human Human Genome Computational Biology Sequence Analysis DNA DNA General Chemistry 030104 developmental biology Evolutionary biology Mutation Genomic lcsh:Q Human genome |
Zdroj: | Nature communications, vol 10, iss 1 Nature Communications, Vol 10, Iss 1, Pp 1-14 (2019) Nature Communications |
ISSN: | 2041-1723 |
Popis: | Large structural variants (SVs) in the human genome are difficult to detect and study by conventional sequencing technologies. With long-range genome analysis platforms, such as optical mapping, one can identify large SVs (>2 kb) across the genome in one experiment. Analyzing optical genome maps of 154 individuals from the 26 populations sequenced in the 1000 Genomes Project, we find that phylogenetic population patterns of large SVs are similar to those of single nucleotide variations in 86% of the human genome, while ~2% of the genome has high structural complexity. We are able to characterize SVs in many intractable regions of the genome, including segmental duplications and subtelomeric, pericentromeric, and acrocentric areas. In addition, we discover ~60 Mb of non-redundant genome content missing in the reference genome sequence assembly. Our results highlight the need for a comprehensive set of alternate haplotypes from different populations to represent SV patterns in the genome. Large structural variants (SV) are understudied in human genetics research because of the difficulty to detect them in the routinely generated short-read sequencing data. Here, the authors generate optical genome maps of 154 individuals from 26 populations that allow comprehensive examination of large SVs. |
Databáze: | OpenAIRE |
Externí odkaz: |