Using Chinese part-of-speech patterns for sentiment phrase identification and opinion extraction in user generated reviews

Autor: Chia-Chun Shih, Ting-Chun Peng
Rok vydání: 2010
Předmět:
Zdroj: ICDIM
DOI: 10.1109/icdim.2010.5664631
Popis: Accelerated growth of the Internet has enabled users worldwide to share their feelings and experiences. User-generated content (UGC) websites are the most abundant sources of user reviews. Accurately identifying sentiment phrases is essential to understand the expressed opinions in user reviews. To achieve this, part-of-speech (POS) patterns of phrases are useful. However, previous studies for Chinese opinion extraction only translate English POS patterns directly into Chinese for this task without considering the feasibility. Therefore, this work proposes a Chinese opinion extraction method that exploits the observed Sinica Treebank POS patterns for sentiment phrase identification. Sinica Treebank is a widely representative POS corpus for Chinese. The results of preliminary experiments indicate that the proposed method is highly effective in extracting opinions from Chinese UGC reviews.
Databáze: OpenAIRE