A novel approach for solving skewed classification problem using cluster based ensemble method

Autor: Amit Kumar Tyagi, V. Krishna Reddy, Gillala Rekha
Rok vydání: 2020
Předmět:
Zdroj: Mathematical Foundations of Computing. 3:1-9
ISSN: 2577-8838
DOI: 10.3934/mfc.2020001
Popis: In numerous real-world applications, the class imbalance problem is prevalent. When training samples of one class immensely outnumber samples of the other classes, the traditional machine learning algorithms show bias towards the majority class (a class with more number of samples) lead to significant losses of model performance. Several techniques have been proposed to handle the problem of class imbalance, including data sampling and boosting. In this paper, we present a cluster-based oversampling with boosting algorithm (Cluster+Boost) for learning from imbalanced data. We evaluate the performance of the proposed approach with state-of-the-art methods based on ensemble learning like AdaBoost, RUSBoost and SMOTEBoost. We conducted experiments on 22 data sets with various imbalance ratios. The experimental results are promising and provide an alternative approach for improving the performance of the classifier when learned on highly imbalanced data sets.
Databáze: OpenAIRE
Nepřihlášeným uživatelům se plný text nezobrazuje