Comparison analysis of Bangla news articles classification using support vector machine and logistic regression

Autor: Md Gulzar Hussain, Babe Sultana, Mahmuda Rahman, Md Rashidul Hasan
Rok vydání: 2023
Předmět:
Popis: In the information age, Bangla news articles on the internet are fast-growing. For organizing, every news site has a particular structure and categorization. News article classification is a method to determine a document’s classification based on various predefined categories. This research discusses the classification of Bangla news articles on the online platform and tries to make constructive comparison using several classification algorithms. For Bangla news articles classification, term frequencyinverse document frequency (TF-IDF) weighting and count vectorizer have been used as a feature extraction process, and two common classifiers named support vector machine (SVM) and logistic regression (LR) employed for classifying the documents. It is clear that the accuracy of the experimental results by applying SVM is 84.0% and LR is 81.0% for twelve categories of news articles. In this research work, when we have made comparison two renowned classification algorithms applied on the Bangla news articles, LR was outperformed by SVM.
Databáze: OpenAIRE