Exploiting Machine Learning for Comparative Sentences Extraction

Autor: Wei Wang, Yong Dong Xu, Tie Jun Zhao, Guo Dong Xin
Rok vydání: 2015
Předmět:
Zdroj: International Journal of Hybrid Information Technology. 8:347-354
ISSN: 1738-9968
DOI: 10.14257/ijhit.2015.8.3.31
Popis: This paper studies the problem of extracting Chinese comparative sentences from user reviews, which is a problem of text classification in the level of sentence. This paper first deals with the class skewed problem of review data, and then builds a SVM (support vector machine) model to classify comparative and non-comparative sentences into different groups on a balanced dataset. Various linguistic and statistical features are introduced to characterize a sentence. Experiments were conducted on user-generated product reviews. As a result, our experiments show significant performance, an overall Fscore of 85.87%.
Databáze: OpenAIRE