Popis: |
Having more samples belonging to one class than the samples of the other class in data used in a classification task is known as class imbalance problem. Handling class imbalance is crucial since the classifier's performance is highly affected. One of the solution approaches of this problem is to make the data balanced by generating synthetic data. Employing resampling methods is a common way of generating synthetic data. Although adversarial generative networks (GANs) are mainly designed to generate image data, they can also be an alternative to solve the class imbalance problem by generating tabular data. This work presents a comparative study of resampling methods with GANs based methods. The performance of machine learning methods improved by 27% if the data is balanced with resampling methods. However, similar performance results were observed with working on imbalanced data if the GANs based methods are employed for synthetic data generation. |