Zobrazeno 1 - 10
of 113
pro vyhledávání: '"Marina Barsky"'
Publikováno v:
Proceedings of the VLDB Endowment. 13:2787-2800
We investigate the problem of building a suffix array substring index for inputs significantly larger than main memory. This problem is especially important in the context of biological sequence analysis, where biological polymers can be thought of a
Publikováno v:
Viruses, Vol 2, Iss 9, Pp 1867-1885 (2010)
We describe the use of Java Pattern Finder (JaPaFi) to identify short (
Externí odkaz:
https://doaj.org/article/87d0888ce7804861974ad1594379e395
Nowadays, textual databases are among the most rapidly growing collections of data. Some of these collections contain a new type of data that differs from classical numerical or textual data. These are long sequences of symbols, not divided into well
Publikováno v:
Proceedings of the VLDB Endowment. 9:13-23
Studying the topology of a network is critical to inferring underlying dynamics such as tolerance to failure, group behavior and spreading patterns. k -core decomposition is a well-established metric which partitions a graph into layers from external
Publikováno v:
Full-Text (Substring) Indexes in External Memory
Nowadays, textual databases are among the most rapidly growing collections of data. Some of these collections contain a new type of data that differs from classical numerical or textual data. These are long sequences of symbols, not divided into well
Publikováno v:
Scopus-Elsevier
In this paper we introduce a new type of pattern -- a flipping correlation pattern. The flipping patterns are obtained from contrasting the correlations between items at different levels of abstraction. They represent surprising correlations, both po
Publikováno v:
Information Systems. 36:644-654
A suffix tree is a fundamental data structure for string searching algorithms. Unfortunately, when it comes to the use of suffix trees in real-life applications, the current methods for constructing suffix trees do not scale for large inputs. As suff
Publikováno v:
ACM Journal of Experimental Algorithmics. 12:1-26
We present a novel graph model and an efficient algorithm for solving the “threshold all against all” problem, which involves searching two strings (with length M and N , respectively) for all maximal approximate substring matches of length at le
Publikováno v:
BMC Research Notes
Background Large DNA sequence data sets require special bioinformatics tools to search and compare them. Such tools should be easy to use so that the data can be easily accessed by a wide array of researchers. In the past, the use of suffix trees for
Nowadays, textual databases are among the most rapidly growing collections of data. Some of these collections contain a new type of data that differs from classical numerical or textual data. These are long sequences of symbols, not divided into well