Customer Personality Analysis for Churn Prediction Using Hybrid Ensemble Models and Class Balancing Techniques

Autor: Noman Ahmad, Mazhar Javed Awan, Haitham Nobanee, Azlan Mohd Zain, Ansar Naseem, Amena Mahmoud
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: IEEE Access, Vol 12, Pp 1865-1879 (2024)
Druh dokumentu: article
ISSN: 2169-3536
DOI: 10.1109/ACCESS.2023.3334641
Popis: Today’s businesses rely heavily on focused marketing to improve their chances of growing and keeping their consumer base. Internet behemoths like Google and Facebook have expanded their business models around targeted advertisements that support business growth. Customer personality identification helps for churn prediction for companies. This problem arises in several companies where customer leaves companies for many reasons. This gap leads to conduct study for customer personality analysis. The collected dataset was highly imbalanced in nature. Two class balancing approaches CTGAN (Conditional tabular Generative adversarial networks) and SMOTE (Synthetic minority oversampling technique) has been utilized to equalize the both classes. There are three ensemble approaches such as bagging, boosting and stacking have been utilized for modeling purpose bagging approach uses Random Forest (RF) boosting utilizes XGBoost (XGB), Light Gradient Boosting Machine (LGBM) and ADA Boost (ADA B). The proposed Hybrid Model HSLR comprises of RF, XGB, ADA Boost, LGBM approaches as base classifiers and LR as a Meta classifier. Three testing independent set, k-fold with 5 and 10 folds have been utilized. To evaluate the performance of classifiers evaluation metrics such as Accuracy score, Precision, Recall, F1 score, MCC and ROC score have been utilized. The SMOTE generated data has shown results as compare with CTGAN generated data. The SMOTE approach has shown the highest results of 94.06, 94.23, 94.28, 94.05, 88.13 and 0.984 as accuracy score, Precision, recall, F1, MCC and Roc score respectively.
Databáze: Directory of Open Access Journals