Application of Machine Learning Classification Algorithm to Precipitation-induced Landslides Forecasting

Autor: Liu Haizhi, Xu Hui, Bao Hongjun, Xu Wei, Yan Xufeng, Lu Heng, Xu Chengpeng
Jazyk: English<br />Chinese
Rok vydání: 2022
Předmět:
Zdroj: 应用气象学报, Vol 33, Iss 3, Pp 282-292 (2022)
Druh dokumentu: article
ISSN: 1001-7313
DOI: 10.11898/1001-7313.20220303
Popis: To address the practical needs of objectively describing the uncertainty of rainfall-based landslides and the existing problems of single warning indicators and subjective forecasting methods in the meteorological disaster early warning business, landslide disaster data from 2014 to 2020 and multi-source used precipitation analysis data are investigated to construct a regional rainfall-induced landslides probability forecasting model. Machine learning classification algorithms is implemented through key steps such as sample construction, model training, parameter optimization and forecast output to explore the feasibility of different types of algorithms in identifying landslides-causing rainfall processes. A training sample set construction method based on the positive samples, the negative samples are obtained by sampling under spatial-temporal limitation. The evaluation of different machine learning classification algorithms using the sample set shows that linear discriminant analysis algorithm has the highest accuracy(0.863) and the best generalization ability(area under the receiver operating characteristic curve is 0.886) without over-fitting problem, followed by the logistic regression algorithm and the K-nearest neighbor algorithm. In the probabilistic forecasting test for the cases of rainfall-induced landslides in 2021, all of three algorithms can extract and learn the conditional features and have certain ability to identify the rainfall processes which induce landslides. K-nearest neighbor algorithms and logistic regression algorithms have a relatively large range of probabilistic forecasting high value areas, which are prone to false alarm results. The probability forecast of the linear discriminant analysis algorithms is more convergent in the range of the high value area, and it can extract local rainfall information better, but it outputs unnecessary low-value probability forecasts in non-rainfall central area. The rainfall-induced landslides probability prediction model based on the machine learning classification algorithm comprehensively considers the coupling effect of the underlying surface factor and the rainfall factor, which is better than the commonly used critical threshold model that assumes the occurrence of landslides in the forecast area is only related to rainfall. The application results show that the machine learning classification algorithm model makes up for the shortcomings of existing forecasting models that are less likely to reflect the influence of the surface environment, so it is an important way to improve the performance of landslides forecasting and warning.
Databáze: Directory of Open Access Journals