Assessing data mining algorithms to predict the quality of groundwater resources for determining irrigation hazard.

Autor: Masoudi, Reyhaneh, Mousavi, Seyed Roohollah, Rahimabadi, Pouyan Dehghan, Panahi, Mehdi, Rahmani, Asghar
Předmět:
Zdroj: Environmental Monitoring & Assessment; Feb2023, Vol. 195 Issue 2, p1-18, 18p
Abstrakt: This study aims to compare three popular machine learning (ML) algorithms including random forest (RF), boosting regression tree (BRT), and multinomial logistic regression (MnLR) for spatial prediction of groundwater quality classes and mapping it for salinity hazard. Three hundred eighty-six groundwater samples were collected from an agriculturally intensive area in Fars Province, Iran, and nine hydro-chemical parameters were defined and interpreted. Variance inflation factor and Pearson's correlations were used to check collinearity between variables. Thereinafter, the performance of ML models was evaluated by statistical indices, namely, overall accuracy (OA) and Kappa index obtained from the confusion matrix. The results showed that the RF model was more accurate than other models with the slight difference. Moreover, the analysis of relative importance also indicated that sodium adsorption ratio (SAR) and pH have the most impact parameters in explaining groundwater quality classes, respectively. In this research, applied ML algorithms along with the hydro-chemical parameters affecting the quality of ground water can lead to produce spatial distribution maps with high accuracy for managing irrigation practice. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index