Sentiment Classification for Film Reviews in Gujarati Text Using Machine Learning and Sentiment Lexicons

Autor: Parita Shah, Priya Swaminarayan, Maitri Patel
Jazyk: angličtina
Rok vydání: 2023
Předmět:
Zdroj: Journal of ICT Research and Applications, Vol 17, Iss 1 (2023)
Druh dokumentu: article
ISSN: 2337-5787
2338-5499
DOI: 10.5614/itbj.ict.res.appl.2023.17.1.1
Popis: In this paper, two techniques for sentiment classification are proposed: Gujarati Lexicon Sentiment Analysis (GLSA) and Gujarati Machine Learning Sentiment Analysis (GMLSA) for sentiment classification of Gujarati text film reviews. Five different datasets were produced to validate the machine learning-based and lexicon-based methods’ accuracy. The lexicon-based approach employs a sentiment lexicon known as GujSentiWordNet, which identifies sentiments with a sentiment score for feature generation, while in the machine learning-based approach, five classifiers are used: logistic regression (LR), random forest (RF), k-nearest neighbors (KNN), support vector machine (SVM), naive Bayes (NB) with TF-IDF, and count vectorizer for feature selection. Experiments were carried out and the results obtained were compared using accuracy, precision, recall, and F-score as performance evaluation criteria. According to the test results, the machine learning-based technique improved accuracy by 3 to 10% on average when compared to the lexicon-based approach.
Databáze: Directory of Open Access Journals