Strength and similarity of affix removal stemming algorithms

Autor: William B. Frakes, Christopher J. Fox
Rok vydání: 2003
Předmět:
Zdroj: ACM SIGIR Forum. 37:26-30
ISSN: 0163-5840
DOI: 10.1145/945546.945548
Popis: This study evaluated the strength of, and similarity among, four affix removal stemming algorithms. Strength and similarity were evaluated in different ways, including new metrics based on the Hamming distance measure. Data was collected on stemmer outputs for a list of 49,656 English words derived from the UNIX spelling dictionary and the Moby corpus. Conclusions about the relative strength and similarity of the four stemming algorithms are reported.
Databáze: OpenAIRE