ABI Base Recall: Automatic Correction and Ends Trimming of DNA Sequences
Autor: | Loubna El Yazouli, Fouzia Radouani, Zakaria Elyazghi, Khalid Sadki |
---|---|
Rok vydání: | 2017 |
Předmět: |
0301 basic medicine
Computer science Interface (computing) Biomedical Engineering Pharmaceutical Science Medicine (miscellaneous) Bioengineering DNA sequencing Set (abstract data type) 03 medical and health sciences Software Electrical and Electronic Engineering Automation Laboratory Internet business.industry Computational Biology DNA Sequence Analysis DNA Base (topology) Computer Science Applications DNA sequencer 030104 developmental biology Trimming business Clipping (computer graphics) Algorithm Sequence Alignment Biotechnology |
Zdroj: | IEEE transactions on nanobioscience. 16(8) |
ISSN: | 1558-2639 |
Popis: | Automated DNA sequencers produce chromatogram files in ABI format. When viewing chromatograms, some ambiguities are shown at various sites along the DNA sequences, because the program implemented in the sequencing machine and used to call bases cannot always precisely determine the right nucleotide, especially when it is represented by either a broad peak or a set of overlaying peaks. In such cases, a letter other than A, C, G, or T is recorded, most commonly N. Thus, DNA sequencing chromatograms need manual examination: checking for mis-calls and truncating the sequence when errors become too frequent. The purpose of this paper is to develop a program allowing the automatic correction of these ambiguities. This application is a Web-based program powered by Shiny and runs under R platform for an easy exploitation. As a part of the interface, we added the automatic ends clipping option, alignment against reference sequences, and BLAST. To develop and test our tool, we collected several bacterial DNA sequences from different laboratories within Institut Pasteur du Maroc and performed both manual and automatic correction. The comparison between the two methods was carried out. As a result, we note that our program, ABI base recall, accomplishes good correction with a high accuracy. Indeed, it increases the rate of identity and coverage and minimizes the number of mismatches and gaps, hence it provides solution to sequencing ambiguities and saves biologists’ time and labor. |
Databáze: | OpenAIRE |
Externí odkaz: |