Lexicon-based emotion analysis in Turkish
Autor: | Adil Alpkocak, Mansur Alp Toçoğlu |
---|---|
Rok vydání: | 2019 |
Předmět: |
General Computer Science
business.industry Computer science Turkish Lemmatisation media_common.quotation_subject Bigram 020206 networking & telecommunications 02 engineering and technology computer.software_genre Lexicon language.human_language Term (time) Weighting Sadness Turkish emotion lexicon emotion extraction lexicon-based emotion analysis key word-spotting Turkish emotion analysis 0202 electrical engineering electronic engineering information engineering Selection (linguistics) language Artificial intelligence Electrical and Electronic Engineering business computer Natural language processing media_common |
Zdroj: | Volume: 27, Issue: 2 1213-1227 Turkish Journal of Electrical Engineering and Computer Science |
ISSN: | 1300-0632 1303-6203 |
Popis: | In this paper, we proposed a lexicon for emotion analysis in Turkish for six emotional categories happiness, fear, anger, sadness, disgust, and surprise. Besides, we also investigated the effects of a lemmatizer and a stemmer, two term-weighting schemes, four lexicon enrichment methods, and a term selection approach for lexicon construction. To do this, we generated Turkish emotion lexicon based on a dataset, TREMO, containing 25,989 documents. We then preprocessed the documents to obtain dictionary and stem forms of each term using a lemmatizer and a stemmer. Afterwards, we proposed two different weighting schemes where term frequency, term-class frequency and mutual information (MI) values for six emotion categories are taken into consideration. We then enriched the lexicon by using bigram and concept hierarchy methods, and performed term selection for efficiency issues. Then, we compared the performance of lexicon-based approach with machine learning based approach by using our proposed lexicon. The experiments showed that the use of the proposed lexicon efficiently produces comparable results in emotion analysis in Turkish text. |
Databáze: | OpenAIRE |
Externí odkaz: |