Approximate phrase searching: Movie scripts and song lyrics

Autor: Kathryn Patterson, Carolyn Watters
Rok vydání: 2008
Předmět:
Zdroj: ASIST
ISSN: 0044-7870
DOI: 10.1002/meet.2008.1450450271
Popis: Search engines provide an effective means of retrieving a document in which a piece of text occurs when the query contains infrequently occurring terms or the query is known to be an exact phrase. However, phrase queries usually contain common terms including determiners and users may not remember phrases exactly. Search engines discard common terms or assign them little importance, which may lead to poor retrieval results. In this paper, we examine the use of proximity-based phrase searching to search for quotes from song lyrics and movie scripts and compare the results against Google.ca, Yahoo.com and Ask.com. An improvement of over 25% on search engine results shows that an additional search method to complement the common search engine methods would be beneficial for this task.
Databáze: OpenAIRE