PacRAT: a program to improve barcode-variant mapping from PacBio long reads using multiple sequence alignment.

Autor: Yeh CC; Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA., Amorosi CJ; Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA., Showman S; Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.; Molecular and Cellular Biology Program, University of Washington, Seattle, WA 98195, USA., Dunham MJ; Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.
Jazyk: angličtina
Zdroj: Bioinformatics (Oxford, England) [Bioinformatics] 2022 May 13; Vol. 38 (10), pp. 2927-2929.
DOI: 10.1093/bioinformatics/btac165
Abstrakt: Summary: Use of PacBio sequencing for characterizing barcoded libraries of genetic variants is on the rise. However, current approaches in resolving PacBio sequencing artifacts can result in a high number of incorrectly identified or unusable reads. Here, we developed a PacBio Read Alignment Tool (PacRAT) that improves the accuracy of barcode-variant mapping through several steps of read alignment and consensus calling. To quantify the performance of our approach, we simulated PacBio reads from eight variant libraries of various lengths and showed that PacRAT improves the accuracy in pairing barcodes and variants across these libraries. Analysis of real (non-simulated) libraries also showed an increase in the number of reads that can be used for downstream analyses when using PacRAT.
Availability and Implementation: PacRAT is written in Python and is freely available (https://github.com/dunhamlab/PacRAT).
Supplementary Information: Supplemental data are available at Bioinformatics online.
(© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.)
Databáze: MEDLINE