Abstrakt: |
The "Daijiworld News" forum, a well-known news website in coastal Karnataka, was the source of the comments for this paper's sentiment analysis study, which was done on about 15,000 reader comments. The comments were scraped using Beautiful Soup, a popular web scraping library and labelled as positive, negative, and neutral. Pre-processing of comments was made using techniques such as stop word removal, tokenization, stemming, lemmatization, and lowercase conversion. Logistic regression, support vector machine (SVM), naive Bayes, random forest, K-nearest neighbors (KNN), AdaBoost, gradient boosting and neural networks was used for classification. Performance metrics including accuracy, precision, recall, and F1 score were evaluated. Logistic regression achieved the highest precision (0.75), recall (0.74), accuracy (0.74), and F1 score (0.74), followed closely by the neural network classifier with a precision of 0.670, recall of 0.670, accuracy of 0.670, and F1 score of 0.669. The study demonstrates the effectiveness of logistic regression and neural networks in sentiment analysis of news forum comments, giving insightful information to grasp public opinion and improving user engagement. The findings contribute to the field of sentiment analysis, emphasising the significance of web scraping and pre-processing techniques in enhancing sentiment classification accuracy. The results serve as a reference for researchers and practitioners, assisting in the selection of appropriate classifiers for sentiment analysis in similar contexts. The study encourages further exploration of advanced techniques to enhance sentiment classification accuracy in regional news forums, paving the way for future research in sentiment analysis. [ABSTRACT FROM AUTHOR] |