Spatial prediction of landslide susceptibility using data mining-based kernel logistic regression, naive Bayes and RBFNetwork models for the Long County area (China)
Autor: | Wei Chen, Dieu Tien Bui, Haoyuan Hong, Biswajeet Pradhan, Zhou Zhao, Xusheng Yan |
---|---|
Rok vydání: | 2018 |
Předmět: |
Variance inflation factor
Radial basis function network Mean squared error 0211 other engineering and technologies Geology Landslide Feature selection 02 engineering and technology 010502 geochemistry & geophysics Geotechnical Engineering and Engineering Geology computer.software_genre 01 natural sciences Normalized Difference Vegetation Index Naive Bayes classifier Multicollinearity Data mining computer 021101 geological & geomatics engineering 0105 earth and related environmental sciences Mathematics |
Zdroj: | Bulletin of Engineering Geology and the Environment. 78:247-266 |
ISSN: | 1435-9537 1435-9529 |
DOI: | 10.1007/s10064-018-1256-z |
Popis: | The main goal of this study is to assess and compare three advanced machine learning techniques, namely, kernel logistic regression (KLR), naive Bayes (NB), and radial basis function network (RBFNetwork) models for landslide susceptibility modeling in Long County, China. First, a total of 171 landslide locations were identified within the study area using historical reports, aerial photographs, and extensive field surveys. All the landslides were randomly separated into two parts with a ratio of 70/30 for training and validation purposes. Second, 12 landslide conditioning factors were prepared for landslide susceptibility modeling, including slope aspect, slope angle, plan curvature, profile curvature, elevation, distance to faults, distance to rivers, distance to roads, lithology, NDVI (normalized difference vegetation index), land use, and rainfall. Third, the correlations between the conditioning factors and the occurrence of landslides were analyzed using normalized frequency ratios. A multicollinearity analysis of the landslide conditioning factors was carried out using tolerances and variance inflation factor (VIF) methods. Feature selection was performed using the chi-squared statistic with a 10-fold cross-validation technique to assess the predictive capabilities of the landslide conditioning factors. Then, the landslide conditioning factors with null predictive ability were excluded in order to optimize the landslide models. Finally, the trained KLR, NB, and RBFNetwork models were used to construct landslide susceptibility maps. The receiver operating characteristics (ROC) curve, the area under the curve (AUC), and several statistical measures, such as accuracy (ACC), F-measure, mean absolute error (MAE), and root mean squared error (RMSE), were used for the assessment, validation, and comparison of the resulting models in order to choose the best model in this study. The validation results show that all three models exhibit reasonably good performance, and the KLR model exhibits the most stable and best performance. The KLR model, which has a success rate of 0.847 and a prediction rate of 0.749, is a promising technique for landslide susceptibility mapping. Given the outcomes of the study, all three models could be used efficiently for landslide susceptibility analysis. |
Databáze: | OpenAIRE |
Externí odkaz: |