Arabic stemming without a root dictionary
Autor: | Kazem Taghva, Jeffrey Coombs, R. Elkhoury |
---|---|
Rok vydání: | 2005 |
Předmět: |
Root (linguistics)
Information retrieval business.industry Computer science Arabic Information technology Information analysis computer.software_genre language.human_language Information science language Pattern matching Artificial intelligence Document retrieval business computer Natural language Natural language processing |
Zdroj: | ITCC (1) |
DOI: | 10.1109/itcc.2005.90 |
Popis: | We have implemented a root-extraction stemmer for Arabic which is similar to the Khoja stemmer but without a root dictionary. Our stemmer was found to perform equivalently to the Khoja stemmer as well as so-called "light" stemmers in monolingual document retrieval tasks performed on the Arabic Trec-2001 collection. A root dictionary, therefore, does not improve Arabic monolingual document retrieval. |
Databáze: | OpenAIRE |
Externí odkaz: |