Evaluation of traditional and machine learning approaches for modeling volatile fatty acid concentrations in anaerobic digestion of sludge: potential and challenges.

Autor: Abubakar UA; Department of Water Resources and Environmental Engineering, Ahmadu Bello University, Zaria, 1045, Kaduna, Nigeria., Lemar GS; Department of Botany, Faculty of Biology, Kabul University, Kart-E-Char, Kabul, Afghanistan., Bello AD; Department of Water Resources and Environmental Engineering, Ahmadu Bello University, Zaria, 1045, Kaduna, Nigeria., Ishaq A; Department of Water Resources and Environmental Engineering, Ahmadu Bello University, Zaria, 1045, Kaduna, Nigeria., Dandajeh AA; Department of Water Resources and Environmental Engineering, Ahmadu Bello University, Zaria, 1045, Kaduna, Nigeria., Jagun ZT; School of Built Environment Engineering and Computing, Leeds Beckett University City Campus, Leeds, UK. z.t.jagun@leedsbeckett.ac.uk., Houmsi MR; New Era and Development in Civil Engineering Research Group, Scientific Research Center, Al-Ayen University, Thi-Qar, Nasiriyah, 64001, Iraq.
Jazyk: angličtina
Zdroj: Environmental science and pollution research international [Environ Sci Pollut Res Int] 2024 Apr 23. Date of Electronic Publication: 2024 Apr 23.
DOI: 10.1007/s11356-024-33281-2
Abstrakt: This study evaluates models for predicting volatile fatty acid (VFA) concentrations in sludge processing, ranging from classical statistical methods (Gaussian and Surge) to diverse machine learning algorithms (MLAs) such as Decision Tree, XGBoost, CatBoost, LightGBM, Multiple linear regression (MLR), Support vector regression (SVR), AdaBoost, and GradientBoosting. Anaerobic bio-methane potential tests were carried out using domestic wastewater treatment primary and secondary sludge. The tests were monitored over 40 days for variations in pH and VFA concentrations under different experimental conditions. The data observed was compared to predictions from the Gaussian and Surge models, and the MLAs. Based on correlation analysis using basic statistics and regression, the Gaussian model appears to be a consistent performer, with high R 2 values and low RMSE, favoring precision in forecasting VFA concentrations. The Surge model, on the other hand, albeit having a high R 2 , has high prediction errors, especially in dynamic VFA concentration settings. Among the MLAs, Decision Tree and XGBoost excel at predicting complicated patterns, albeit with overfitting issues. This study provides insights underlining the need for context-specific considerations when selecting models for accurate VFA forecasts. Real-time data monitoring and collaborative data sharing are required to improve the reliability of VFA prediction models in AD processes, opening the way for breakthroughs in environmental sustainability and bioprocessing applications.
(© 2024. The Author(s).)
Databáze: MEDLINE