A novel multi-stage ensemble model with fuzzy clustering and optimized classifier composition for corporate bankruptcy prediction
Autor: | José Humberto Ablanedo-Rosas, Xin Wu, Wenyu Zhang, Dongqi Yang, Wangzhi Yu, Lingxiao Yang |
---|---|
Rok vydání: | 2021 |
Předmět: |
Statistics and Probability
0209 industrial biotechnology Fuzzy clustering Ensemble forecasting business.industry Computer science General Engineering Pattern recognition 02 engineering and technology Multi stage ComputingMethodologies_PATTERNRECOGNITION 020901 industrial engineering & automation Artificial Intelligence 0202 electrical engineering electronic engineering information engineering Bankruptcy prediction 020201 artificial intelligence & image processing Artificial intelligence business Classifier (UML) |
Zdroj: | Journal of Intelligent & Fuzzy Systems. 40:4169-4185 |
ISSN: | 1875-8967 1064-1246 |
Popis: | With the rapid development of commercial credit mechanisms, credit funds have become fundamental in promoting the development of manufacturing corporations. However, large-scale, imbalanced credit application information poses a challenge to accurate bankruptcy predictions. A novel multi-stage ensemble model with fuzzy clustering and optimized classifier composition is proposed herein by combining the fuzzy clustering-based classifier selection method, the random subspace (RS)-based classifier composition method, and the genetic algorithm (GA)-based classifier compositional optimization method to achieve accuracy in predicting bankruptcy among corporates. To overcome the inherent inflexibility of traditional hard clustering methods, a new fuzzy clustering-based classifier selection method is proposed based on the mini-batch k-means algorithm to obtain the best performing base classifiers for generating classifier compositions. The RS-based classifier composition method was applied to enhance the robustness of candidate classifier compositions by randomly selecting several subspaces in the original feature space. The GA-based classifier compositional optimization method was applied to optimize the parameters of the promising classifier composition through the iterative mechanism of the GA. Finally, six datasets collected from the real world were tested with four evaluation indicators to assess the performance of the proposed model. The experimental results showed that the proposed model outperformed the benchmark models with higher predictive accuracy and efficiency. |
Databáze: | OpenAIRE |
Externí odkaz: |