Ensemble learning landslide susceptibility assessment with optimized non-landslide samples selection

Autor: Jiangang Lu, Yi He, Lifeng Zhang, Qing Zhang, Binghai Gao, Hesheng Chen, Yumin Fang
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: Geomatics, Natural Hazards & Risk, Vol 15, Iss 1 (2024)
Druh dokumentu: article
ISSN: 19475705
1947-5713
1947-5705
DOI: 10.1080/19475705.2024.2378176
Popis: Non-landslide samples influence the outcomes of landslide susceptibility assessment. Existing studies did not fully consider the equilibrium between landslide and non-landslide samples in similar environments, resulting in poor reliability of landslide susceptibility assessment. This study proposed a non-landslide samples optimization method with a constraint of disaster-pregnant environment similarity to construct the balanced landslide and non-landslide samples. We employed the heterogeneous stacking and blending ensemble learning models to generate the landslide susceptibility assessment. This study focused on the Bailong River Basin with complex environment and frequent landslides as the study area. First, we extracted 12 landslide influencing factors based on multiple sources and analyzed the spatial distribution patterns of landslides. Second, we constructed similar environments based on the assessment units obtained from the curvature watershed method and selected an equal amount of both landslide and non-landslide samples in every different environment. Finally, three classic neural network models, namely multilayer perceptron, convolutional neural network, and gated recurrent unit were used as base models for stacking and blending ensemble learning models to assess landslide susceptibility. The findings suggested that the landslide susceptibility assessment results with optimized non-landslide samples were more reliable, especially the proposed method improved the prediction results in sample-sparse regions. The stacking ensemble constructed in this study demonstrated the highest area under the curve of 0.88 for the testing dataset, outperforming the blending ensemble and three base models. The issue of unreliable landslide susceptibility assessment results in sample-sparse regions within complex environments can be effectively addressed by considering the balanced sampling of non-landslide samples under the constraint of similar environments.
Databáze: Directory of Open Access Journals