Japanese query alteration based on semantic similarity
Autor: | Hisami Suzuki, Masato Hagiwara |
---|---|
Rok vydání: | 2009 |
Předmět: |
Information retrieval
Web search query Computer science business.industry Query optimization computer.software_genre Task (computing) Query expansion Semantic similarity Similarity (network science) Kernel (image processing) Web query classification Artificial intelligence business computer Natural language processing |
Zdroj: | Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics on - NAACL '09. |
Popis: | We propose a unified approach to web search query alterations in Japanese that is not limited to particular character types or orthographic similarity between a query and its alteration candidate. Our model is based on previous work on English query correction, but makes some crucial improvements: (1) we augment the query-candidate list to include orthographically dissimilar but semantically similar pairs; and (2) we use kernel-based lexical semantic similarity to avoid the problem of data sparseness in computing query-candidate similarity. We also propose an efficient method for generating query-candidate pairs for model training and testing. We show that the proposed method achieves about 80% accuracy on the query alteration task, improving over previously proposed methods that use semantic similarity. |
Databáze: | OpenAIRE |
Externí odkaz: |