Autor: |
Jamshed Khan, Marek Kokot, Sebastian Deorowicz, Rob Patro |
Jazyk: |
angličtina |
Rok vydání: |
2022 |
Předmět: |
|
Zdroj: |
Genome Biology, Vol 23, Iss 1, Pp 1-32 (2022) |
Druh dokumentu: |
article |
ISSN: |
1474-760X |
DOI: |
10.1186/s13059-022-02743-6 |
Popis: |
Abstract The de Bruijn graph is a key data structure in modern computational genomics, and construction of its compacted variant resides upstream of many genomic analyses. As the quantity of genomic data grows rapidly, this often forms a computational bottleneck. We present Cuttlefish 2, significantly advancing the state-of-the-art for this problem. On a commodity server, it reduces the graph construction time for 661K bacterial genomes, of size 2.58Tbp, from 4.5 days to 17–23 h; and it constructs the graph for 1.52Tbp white spruce reads in approximately 10 h, while the closest competitor requires 54–58 h, using considerably more memory. |
Databáze: |
Directory of Open Access Journals |
Externí odkaz: |
|