SweGen: a whole-genome data resource of genetic variability in a cross-section of the Swedish population.

Autor: Ameur A; Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden.; National Genomics Infrastructure, Science for Life Laboratory, Sweden., Dahlberg J; National Genomics Infrastructure, Science for Life Laboratory, Sweden.; Science for Life Laboratory, Department of Medical Sciences, Molecular Medicine, Uppsala University, Uppsala, Sweden., Olason P; Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden.; National Bioinformatics Infrastructure, Science for Life Laboratory, Sweden., Vezzi F; National Genomics Infrastructure, Science for Life Laboratory, Sweden.; Science for Life Laboratory, Department of Biochemistry and Biophysics (DBB), Stockholm University, Stockholm, Sweden., Karlsson R; Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden., Martin M; National Bioinformatics Infrastructure, Science for Life Laboratory, Sweden.; Science for Life Laboratory, Department of Biochemistry and Biophysics (DBB), Stockholm University, Stockholm, Sweden., Viklund J; Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden.; National Bioinformatics Infrastructure, Science for Life Laboratory, Sweden., Kähäri AK; Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden.; National Bioinformatics Infrastructure, Science for Life Laboratory, Sweden., Lundin P; Science for Life Laboratory, Department of Biochemistry and Biophysics (DBB), Stockholm University, Stockholm, Sweden., Che H; Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden., Thutkawkorapin J; Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden., Eisfeldt J; Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden., Lampa S; National Bioinformatics Infrastructure, Science for Life Laboratory, Sweden.; Department of Pharmaceutical Biosciences, Uppsala University, Uppsala, Sweden., Dahlberg M; National Bioinformatics Infrastructure, Science for Life Laboratory, Sweden.; Science for Life Laboratory, Department of Biochemistry and Biophysics (DBB), Stockholm University, Stockholm, Sweden., Hagberg J; National Bioinformatics Infrastructure, Science for Life Laboratory, Sweden.; Science for Life Laboratory, Department of Biochemistry and Biophysics (DBB), Stockholm University, Stockholm, Sweden., Jareborg N; National Bioinformatics Infrastructure, Science for Life Laboratory, Sweden.; Science for Life Laboratory, Department of Biochemistry and Biophysics (DBB), Stockholm University, Stockholm, Sweden., Liljedahl U; National Genomics Infrastructure, Science for Life Laboratory, Sweden.; Science for Life Laboratory, Department of Medical Sciences, Molecular Medicine, Uppsala University, Uppsala, Sweden., Jonasson I; Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden.; National Genomics Infrastructure, Science for Life Laboratory, Sweden., Johansson Å; Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden., Feuk L; Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden., Lundeberg J; National Genomics Infrastructure, Science for Life Laboratory, Sweden.; Science for Life Laboratory, School of Biotechnology, Division of Gene Technology, Royal Institute of Technology, Stockholm, Sweden., Syvänen AC; National Genomics Infrastructure, Science for Life Laboratory, Sweden.; Science for Life Laboratory, Department of Medical Sciences, Molecular Medicine, Uppsala University, Uppsala, Sweden., Lundin S; Science for Life Laboratory, School of Biotechnology, Division of Gene Technology, Royal Institute of Technology, Stockholm, Sweden., Nilsson D; Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden., Nystedt B; Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden.; National Bioinformatics Infrastructure, Science for Life Laboratory, Sweden., Magnusson PK; Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden., Gyllensten U; Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden.; National Genomics Infrastructure, Science for Life Laboratory, Sweden.
Jazyk: angličtina
Zdroj: European journal of human genetics : EJHG [Eur J Hum Genet] 2017 Nov; Vol. 25 (11), pp. 1253-1260. Date of Electronic Publication: 2017 Aug 23.
DOI: 10.1038/ejhg.2017.130
Abstrakt: Here we describe the SweGen data set, a comprehensive map of genetic variation in the Swedish population. These data represent a basic resource for clinical genetics laboratories as well as for sequencing-based association studies by providing information on genetic variant frequencies in a cohort that is well matched to national patient cohorts. To select samples for this study, we first examined the genetic structure of the Swedish population using high-density SNP-array data from a nation-wide cohort of over 10 000 Swedish-born individuals included in the Swedish Twin Registry. A total of 1000 individuals, reflecting a cross-section of the population and capturing the main genetic structure, were selected for whole-genome sequencing. Analysis pipelines were developed for automated alignment, variant calling and quality control of the sequencing data. This resulted in a genome-wide collection of aggregated variant frequencies in the Swedish population that we have made available to the scientific community through the website https://swefreq.nbis.se. A total of 29.2 million single-nucleotide variants and 3.8 million indels were detected in the 1000 samples, with 9.9 million of these variants not present in current databases. Each sample contributed with an average of 7199 individual-specific variants. In addition, an average of 8645 larger structural variants (SVs) were detected per individual, and we demonstrate that the population frequencies of these SVs can be used for efficient filtering analyses. Finally, our results show that the genetic diversity within Sweden is substantial compared with the diversity among continental European populations, underscoring the relevance of establishing a local reference data set.
Databáze: MEDLINE