A Semantic Relatedness Measure Based on Combined Encyclopedic, Ontological and Collocational Knowledge

Autor: Haralambous, Yannis, Klyuev, Vitaly
Rok vydání: 2011
Předmět:
Zdroj: Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang-Mai, Thailand, November 8-13, 2011
Druh dokumentu: Working Paper
Popis: We describe a new semantic relatedness measure combining the Wikipedia-based Explicit Semantic Analysis measure, the WordNet path measure and the mixed collocation index. Our measure achieves the currently highest results on the WS-353 test: a Spearman rho coefficient of 0.79 (vs. 0.75 in (Gabrilovich and Markovitch, 2007)) when applying the measure directly, and a value of 0.87 (vs. 0.78 in (Agirre et al., 2009)) when using the prediction of a polynomial SVM classifier trained on our measure. In the appendix we discuss the adaptation of ESA to 2011 Wikipedia data, as well as various unsuccessful attempts to enhance ESA by filtering at word, sentence, and section level.
Comment: 6 pages, 6 figures, accepted for publication at IJCNLP2011 Conference
Databáze: arXiv