PacRAT: a program to improve barcode-variant mapping from PacBio long reads using multiple sequence alignment

Autor: Chiann-Ling C. Yeh, Maitreya J. Dunham, Soyeon Showman, Clara J. Amorosi
Rok vydání: 2021
Předmět:
Zdroj: Bioinformatics
ISSN: 1367-4811
Popis: Summary Use of PacBio sequencing for characterizing barcoded libraries of genetic variants is on the rise. However, current approaches in resolving PacBio sequencing artifacts can result in a high number of incorrectly identified or unusable reads. Here, we developed a PacBio Read Alignment Tool (PacRAT) that improves the accuracy of barcode-variant mapping through several steps of read alignment and consensus calling. To quantify the performance of our approach, we simulated PacBio reads from eight variant libraries of various lengths and showed that PacRAT improves the accuracy in pairing barcodes and variants across these libraries. Analysis of real (non-simulated) libraries also showed an increase in the number of reads that can be used for downstream analyses when using PacRAT. Availability and implementation PacRAT is written in Python and is freely available (https://github.com/dunhamlab/PacRAT). Supplementary information Supplemental data are available at Bioinformatics online.
Databáze: OpenAIRE