A space-efficient solution to find the maximum overlap using a compressed suffix array

Autor: Qutaibah M. Malluhi, Mohamed Abouelhoda, Maan Haj Rachid
Rok vydání: 2014
Předmět:
Zdroj: 2nd Middle East Conference on Biomedical Engineering.
DOI: 10.1109/mecbme.2014.6783270
Popis: Compressed indices are important data structures in stringology. Compressed versions of many well-known data structures such as suffix tree and suffix array, which are used in string matching problems, have been studied and proposed. This paper takes advantage of a very recent compressed suffix array to build a space-economic solution for an important bioinformatics problem, namely the all-pairs suffix prefix problem. The paper also presents a simple technique for parallelizing the solution. Our results show that the proposed solution consumes less than one fifth of the space required by other solutions based on standard data structures. In addition, our results demonstrate that good performance scalability can be achieved by employing the proposed parallel algorithm.
Databáze: OpenAIRE