Albanian News Category Predictor System using a Multinomial Naïve Bayes and Logistic Regression Algorithms

Autor: Lamir Shkurti, Faton Kabashi
Rok vydání: 2021
Předmět:
Zdroj: 2021 5th International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT).
DOI: 10.1109/ismsit52890.2021.9604602
Popis: In this paper, we proposed a system for the classification of different news in the Albanian language using the Multinomial Naive Bayes and Logistic Regression algorithms. The system is web-based and developed with Python programing language and Flask web framework. For the classification of Albanian language news, we have used 70% of the dataset for training and 30% for tests. The result shows that our system can use for the classification of Albanian language news. The proposed Multinomial Naive Bayes and Logistic Regression classifiers were analyzed and compared for the following evaluation parameters: Accuracy, Precision, Recall, F1-score, and Confusion Matrix. The obtained results showed that the Logistic Regression algorithm has higher accuracy compared to the Naive Bayes Multinomial algorithm in the classification of news in the Albanian language.
Databáze: OpenAIRE