Twitter Sentiment Analysis Using Different Machine Learning and Feature Extraction Techniques
Autor: | Zainab Namh Sultani, Mohammad W. Habib |
---|---|
Rok vydání: | 2021 |
Předmět: |
Computer science
business.industry Feature vector Sentiment analysis Feature extraction General Engineering Logistic regression Machine learning computer.software_genre Support vector machine Naive Bayes classifier Preprocessor Artificial intelligence InformationSystems_MISCELLANEOUS Dimension (data warehouse) business computer |
Zdroj: | Al-Nahrain Journal of Science. 24:50-54 |
ISSN: | 2663-5461 2663-5453 |
DOI: | 10.22401/anjs.24.3.08 |
Popis: | Twitter is considered a significant source of exchanging information and opinion in today's business. Analysis of this data is critical and complex due to the size of the dataset. Sentiment Analysis is adopted to understand and analyze the sentiment of such data. In this paper, a Machine learning approach is employed for analyzing the data into positive or negative sentiment (opinion). Different arrangements of preprocessing techniques are applied to clean the tweets, and various feature extraction methods are used to extract and reduce the dimension of the tweets' feature vector. Sentiment140 dataset is used, and it consists of sentiment labels and tweets, so supervised machine learning models are used, specifically Logistic Regression, Naive Bayes, and Support Vector Machine. According to the experimental results, Logistic Regression was the best amongst other models with all feature extraction techniques. |
Databáze: | OpenAIRE |
Externí odkaz: |