Cache and Priority Queue Based Approximation Technique for a Stream of Similarity Search Queries
Autor: | Michal Batko, Filip Nalepa, Pavel Zezula |
---|---|
Rok vydání: | 2017 |
Předmět: |
Computer science
Nearest neighbor search Context (language use) 02 engineering and technology Parallel computing computer.software_genre Stream processing 020204 information systems 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Double-ended priority queue Data mining Cache Priority queue Throughput (business) computer Cache algorithms |
Zdroj: | Similarity Search and Applications ISBN: 9783319684734 SISAP |
DOI: | 10.1007/978-3-319-68474-1_2 |
Popis: | Content-based similarity search techniques have been employed in a variety of today applications. In our work, we aim at the scenario when the similarity search is applied in the context of stream processing. In particular, there is a stream of query objects which need to be evaluated. Our goal is to be able to cope with the rate of incoming query objects (i.e., to reach sufficient throughput) and, at the same time, to preserve the quality of the obtained results at high levels. We propose an approximation technique for the similarity search which combines the probability of an indexed object to be a part of a query result and the time needed to examine the object. We are able to achieve better trade-off between the efficiency (processing time) and the quality (precision) of the similarity search compared to traditional priority queue based approximation techniques. |
Databáze: | OpenAIRE |
Externí odkaz: |