Popis: |
This paper investigates negative sentiment tweets with the presence of hyperboles for sarcasm detection. Six thousand and six hundred pre-processed negative sentiment tweets comprising #Chinesevirus, #Kungflu, #COVID19, #Hantavirus and #Coronavirus were gathered for sarcasm detection. Five hyperbole features, namely interjection, intensifier, capital letter, punctuation mark and elongated word were analysed using three renowned machine learning algorithms, that is, Support Vector Machine, Random Forest, and Random Forest with Bagging. With the presence of hyperbolic words in the tweets in an unbiased dataset, the proposed model with elongated word achieved an accuracy and F-score of 78.74% and 71%, respectively. Intensifier was found to be the most significant hyperbole (p |