Exploring the Performance of Resampling Strategies for the Class Imbalance Problem.

Autor: García, Vicente, Sánchez, José Salvador, Mollineda, Ramón A.
Zdroj: Trends in Applied Intelligent Systems; 2010, p541-549, 9p
Abstrakt: The present paper studies the influence of two distinct factors on the performance of some resampling strategies for handling imbalanced data sets. In particular, we focus on the nature of the classifier used, along with the ratio between minority and majority classes. Experiments using eight different classifiers show that the most significant differences are for data sets with low or moderate imbalance: over-sampling clearly appears as better than under-sampling for local classifiers, whereas some under-sampling strategies outperform over-sampling when employing classifiers with global learning. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index