An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices

Autor: Mahmood Yousefi, Vahide Oskoei, Hamid Reza Esmaeli, Mansour Baziar
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: Results in Engineering, Vol 24, Iss , Pp 103534- (2024)
Druh dokumentu: article
ISSN: 2590-1230
DOI: 10.1016/j.rineng.2024.103534
Popis: The management of water quality plays a pivotal role in fostering sustainable development, especially in areas where groundwater serves as a vital resource for agricultural purposes. This study investigates methods to predict water quality indicators like potential salinity (PS) and sodium adsorption ratio (SAR) using data-driven techniques. It explores the AdaBoost algorithm and a hybrid model that combines AdaBoost with Extra Trees (ET). The research centers on the Sarayan region in southern Khorasan, Iran, involving the collection and analysis of groundwater quality data, which includes both physical and chemical parameters, spanning a period of four years. Pearson correlation analysis is utilized in this study to determine the critical input variables for predicting Sodium Adsorption Ratio (SAR) and Potential Salinity (PS). Significant performance improvements are demonstrated through the optimization of AdaBoost and the hybrid AdaBoost-Extra Trees (ET) models, achieved via grid search and Bayesian optimization. The results indicate that the hybrid model surpasses AdaBoost in performance. Specifically, the optimized AdaBoost model achieves a Test MSE of 3.57 and a Test R² of 0.87 for SAR prediction, whereas the hybrid model obtains a significantly better Test MSE of 0.87 and Test R² of 0.97. For the prediction of PS, the optimized AdaBoost model yields a Test MSE of 0.223 and a Test R² of 0.976, while the hybrid model delivers a significantly improved Test MSE of 0.018 and Test R² of 0.998. Feature importance analysis identifies critical patterns in the relevance of input parameters. This study underscores the efficacy of hybrid data-driven models in water quality prediction, showcasing their superior accuracy.
Databáze: Directory of Open Access Journals