LinkedSV for detection of mosaic structural variants from linked-read exome and genome sequencing data

Autor: Sören-Sebastian Wenzel, Kai Wang, Katharina Wimmer, Li Fang, Renata Pellegrino da Silva, Charlly Kao, Hakon Hakonarson, Fernanda Abani Mafra, Mingyao Li, Michael Gonzalez
Jazyk: angličtina
Rok vydání: 2019
Předmět:
Zdroj: Nature Communications, Vol 10, Iss 1, Pp 1-15 (2019)
Nature Communications
ISSN: 2041-1723
DOI: 10.1038/s41467-019-13397-7
Popis: Linked-read sequencing provides long-range information on short-read sequencing data by barcoding reads originating from the same DNA molecule, and can improve detection and breakpoint identification for structural variants (SVs). Here we present LinkedSV for SV detection on linked-read sequencing data. LinkedSV considers barcode overlapping and enriched fragment endpoints as signals to detect large SVs, while it leverages read depth, paired-end signals and local assembly to detect small SVs. Benchmarking studies demonstrate that LinkedSV outperforms existing tools, especially on exome data and on somatic SVs with low variant allele frequencies. We demonstrate clinical cases where LinkedSV identifies disease-causal SVs from linked-read exome sequencing data missed by conventional exome sequencing, and show examples where LinkedSV identifies SVs missed by high-coverage long-read sequencing. In summary, LinkedSV can detect SVs missed by conventional short-read and long-read sequencing approaches, and may resolve negative cases from clinical genome/exome sequencing studies.
Compared to single nucleotide variants and short indels, structural variants (SVs) are often more challenging to detect using high-throughput sequencing based methods. Here, the authors develop LinkedSV, a computational tool for SV detection using linked-read exome and genome sequencing data.
Databáze: OpenAIRE