Textual Feature Ensemble-Based Sarcasm Detection in Twitter Data

Autor: Anandhakumar Palanisamy, J. Vijay Saravana, Karthik Sundararajan
Rok vydání: 2020
Předmět:
Zdroj: Intelligence in Big Data Technologies—Beyond the Hype ISBN: 9789811552847
Popis: Sarcasm is one of the most popular ways of expressing one’s opinion on a specific subject. Sarcasm is an act of conveying a viewpoint with an opposite emotion. Identifying such statements among the textual information is an important challenge present in sentiment analysis. Often, sarcastic statements are misleading. They mislead the whole context of the sentence; hence, detecting sarcasm is important in order to understand the actual meaning involved. The main objective of this paper is to propose a feature ensemble-based learning algorithm for sarcasm detection. Most of the existing works focused on using lexical features for identifying sarcasm. In the proposed work apart from the traditional lexical features, hyperbolic and pragmatic features have been extracted. After extracting the features, a model for detecting sarcasm has been proposed based on stacking-based feature ensemble algorithm. Experimental results show that the feature-based approaches when considered individually, Internet slang-based features attain an accuracy of 62%; interjection features obtained an accuracy of 49%, and emoticon-based features attained 59%, respectively. However, the proposed ensemble learning algorithm attains an overall accuracy of 83%, which is much better than the feature set-based approaches.
Databáze: OpenAIRE