Assessing the effect of ensemble learning algorithms and validation approach on estimating forest aboveground biomass: a case study of natural secondary forest in Northeast China

Autor: Hungil Jin, Yinghui Zhao, Unil Pak, Zhen Zhen, Kumryong So
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: Geo-spatial Information Science, Pp 1-20 (2024)
Druh dokumentu: article
ISSN: 10095020
1993-5153
1009-5020
DOI: 10.1080/10095020.2024.2311261
Popis: Accurate estimation of forest aboveground biomass is essential for the assessment of regional carbon cycle and the climate change in the terrestrial ecosystem. Currently, ensemble learning algorithms and cross-validation methods have been widely applied to estimate regional forest Above Ground Biomass (AGB). However, the effects of ensemble learning algorithms, validation methods, and their interactions on forest AGB estimation were rarely investigated. Based on Landsat 8 Operational Land Imager (OLI) imagery, Airborne Laser Scanning (ALS) data and China’s National Forest Continuous Inventory data, this study explored the effects of five ensemble learning algorithms, including Simple Averaging (SA), Weighted Averaging (WA), Stacked Generalization (SG), Random Forest (RF) and Extreme Gradient Boosting (XGBoost), and two validation approaches (i.e. 10-fold and leave-one-out cross-validation) on the AGB estimation of the Natural Secondary Forests (NSFs) in northeast China. The results revealed that the ensemble learning algorithms that combine heterogenous-based models (i.e. SA, WA, SG) generally produced higher accuracy than the base models (i.e. Decision Tree (DT), K-Nearest Neighbor (KNN), Support Vector Regression (SVR), Convolutional Neural Network (CNN)). Among all ensemble learning algorithms, the SG algorithm has the highest accuracy whereas the XGBoost algorithm has the lowest accuracy. Although prediction models considerably impact the accuracy of AGB estimation, the validation approach also plays a non-negligible role in AGB estimation. The leave-one-out cross-validation produced much higher accuracy than the 10-fold cross-validation using the same prediction model and tends to generate over-optimistic AGB estimates compared to 10-fold cross-validation, especially for the averaging and stacking ensemble learning algorithms (i.e. SA, WA, SG). This study highlights the potential challenges of applying a leave-one-out cross-validation approach and provides a scientific foundation for the feasibility of different ensemble learning algorithms and cross-validation approaches for accurate AGB estimation.
Databáze: Directory of Open Access Journals