Twitter Sentiment Analysis Using Different Machine Learning and Feature Extraction Techniques

Autor: Zainab Namh Sultani, Mohammad W. Habib
Rok vydání: 2021
Předmět:
Zdroj: Al-Nahrain Journal of Science. 24:50-54
ISSN: 2663-5461
2663-5453
DOI: 10.22401/anjs.24.3.08
Popis: Twitter is considered a significant source of exchanging information and opinion in today's business. Analysis of this data is critical and complex due to the size of the dataset. Sentiment Analysis is adopted to understand and analyze the sentiment of such data. In this paper, a Machine learning approach is employed for analyzing the data into positive or negative sentiment (opinion). Different arrangements of preprocessing techniques are applied to clean the tweets, and various feature extraction methods are used to extract and reduce the dimension of the tweets' feature vector. Sentiment140 dataset is used, and it consists of sentiment labels and tweets, so supervised machine learning models are used, specifically Logistic Regression, Naive Bayes, and Support Vector Machine. According to the experimental results, Logistic Regression was the best amongst other models with all feature extraction techniques.
Databáze: OpenAIRE