A machine learning approach for predicting bank credit worthiness
Autor: | Gideon Evans Wenya, Regina Esi Turkson, Edward Yeallakuor Baagyere |
---|---|
Rok vydání: | 2016 |
Předmět: |
business.industry
Computer science Big data 02 engineering and technology Machine learning computer.software_genre Range (mathematics) Bank credit 020204 information systems 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Artificial intelligence business computer |
Zdroj: | 2016 Third International Conference on Artificial Intelligence and Pattern Recognition (AIPR). |
DOI: | 10.1109/icaipr.2016.7585216 |
Popis: | Machine learning is an emerging technique for building analytic models for machines to "learn" from data and be able to do predictive analysis. The ability of machines to "learn" and do predictive analysis is very important in this era of big data and it has a wide range of application areas. For instance, banks and financial institutions are sometimes faced with the challenge of what risk factors to consider when advancing credit/loans to customers. For several features/attributes of the customers are normally taken into consideration, but most of these features have little predictive e ect on the credit worthiness or otherwise of the customer. Furthermore, a robust and e ective automated bank credit risk score that can aid in the prediction of customer credit worthiness very accurately is still a major challenge facing many banks. In this paper, we examine a real bank credit data and conduct several machine learning algorithms on the data for comparative analysis and to choose which algorithms are the best fit for learning bank credit data. The algorithms gave over 80% accuracy in prediction. Furthermore, the most important features that determine whether a customer will default or otherwise in paying his/her credit the next month are extracted from a total of 23 features. We then applied these most important features on some selected machine learning algorithms and compare their predictive accuracy with the other algorithms that used all the 23 features. The results show no significant di erence, signifying that these features can accurately determine the credit worthiness of the customers. Finally, we formulate a predictive model using the most important features to predict the credit worthiness of a given customer. |
Databáze: | OpenAIRE |
Externí odkaz: |