Forecasting Loan Default in Europe with Machine Learning
Autor: | Luca Barbaglia, Sebastiano Manzan, Elisa Tosetti |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2021 |
Předmět: |
Economics and Econometrics
Boosting (machine learning) Computer science media_common.quotation_subject Big data Settore SECS-P/05 - Econometria Machine learning computer.software_genre Logistic regression Carry (investment) Benchmark (surveying) 0502 economics and business 050207 economics Set (psychology) media_common 050208 finance business.industry 05 social sciences Interest rate Loan Settore SECS-S/03 - Statistica Economica Default Artificial intelligence business Risk assessment computer Finance Credit risk |
Popis: | We use a large data set of over 12 million residential mortgages observed over time to investigate the loan default behavior in several European countries. We model the occurrence of default as a function of borrower characteristics, loan-specific variables, and a set of local economic conditions. Given the high geographical heterogeneity in default and its drivers, we carry out the analysis at the regional level. We adopt boosting algorithms from the machine learning literature and compare their performance relative to the logistic regression. With respect to the logistic benchmark, boosting models perform significantly better in providing predictions. The most important variables in explaining loan default are the interest rate currently applied to the mortgage and the local economic characteristics, while other loan- or borrower-specific features are less relevant. Our results indicate the existence of relevant geographical heterogeneity in the importance of the variables, pointing at the need for regionally tailored risk assessment and policies in Europe. |
Databáze: | OpenAIRE |
Externí odkaz: |