Insights into the prediction uncertainty of machine-learning-based digital soil mapping through a local attribution approach

Autor: J. Rohmer, S. Belbeze, D. Guyonnet
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: SOIL, Vol 10, Pp 679-697 (2024)
Druh dokumentu: article
ISSN: 2199-3971
2199-398X
DOI: 10.5194/soil-10-679-2024
Popis: Machine learning (ML) models have become key ingredients for digital soil mapping. To improve the interpretability of their predictions, diagnostic tools such as the widely used local attribution approach known as SHapley Additive exPlanations (SHAP) have been developed. However, the analysis of ML model predictions is only one part of the problem, and there is an interest in obtaining deeper insights into the drivers of the prediction uncertainty as well, i.e. explaining why an ML model is confident given the set of chosen covariate values in addition to why the ML model delivered some particular results. In this study, we show how to apply SHAP to local prediction uncertainty estimates for a case of urban soil pollution – namely, the presence of petroleum hydrocarbons in soil in Toulouse (France), which pose a health risk via vapour intrusion into buildings, direct soil ingestion, and groundwater contamination. Our results show that the drivers of the prediction best estimates are not necessarily the drivers of confidence in these predictions, and we identify those leading to a reduction in uncertainty. Our study suggests that decisions regarding data collection and covariate characterisation as well as communication of the results should be made accordingly.
Databáze: Directory of Open Access Journals