Crash Severity Analysis of Highways Based on Multinomial Logistic Regression Model, Decision Tree Techniques, and Artificial Neural Network: A Modeling Comparison
Autor: | Gholamreza Shiran, Razieh Khayamim, Reza Imaninasab |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2021 |
Předmět: |
Computer science
media_common.quotation_subject Geography Planning and Development Decision tree TJ807-830 Crash 02 engineering and technology Management Monitoring Policy and Law TD194-195 Renewable energy sources 0502 economics and business Statistics 0202 electrical engineering electronic engineering information engineering GE1-350 media_common Multinomial logistic regression 050210 logistics & transportation Variables Artificial neural network crash severity Environmental effects of industries and plants Renewable Energy Sustainability and the Environment 05 social sciences Statistical model CHAID Environmental sciences decision tree techniques multinomial logistic regression model 020201 artificial intelligence & image processing Predictive modelling artificial neural network |
Zdroj: | Sustainability, Vol 13, Iss 5670, p 5670 (2021) Sustainability Volume 13 Issue 10 |
ISSN: | 2071-1050 |
Popis: | The classification of vehicular crashes based on their severity is crucial since not all of them have the same financial and injury values. In addition, avoiding crashes by identifying their influential factors is possible via accurate prediction modeling. In crash severity analysis, accurate and time-saving prediction models are necessary for classifying crashes based on their severity. Moreover, statistical models are incapable of identifying the potential severity of crashes regarding influencing factors incorporated in models. Unlike previous research efforts, which focused on the limited class of crash severity, including property damage only (PDO), fatality, and injury by applying data mining models, the present study sought to predict crash frequency according to five severity levels of PDO, fatality, severe injury, other visible injuries, and complaint of pain. The multinomial logistic regression (MLR) model and data mining approaches, including artificial neural network-multilayer perceptron (ANN-MLP) and two decision tree techniques, (i.e., Chi-square automatic interaction detector (CHAID) and C5.0) are utilized based on traffic crash records for State Highways in California, USA. The comparison of the findings of the relative importance of ten qualitative and ten quantitative independent variables incorporated in CHAID and C5.0 indicated that the cause of the crash (X1) and the number of vehicles (X5) were known as the most influential variables involved in the crash. However, the cause of the crash (X1) and weather (X2) were identified as the most contributing variables by the ANN-MLP model. In addition, the MLR model showed that the driver’s age (X11) accounts for a larger proportion of traffic crash severity. Therefore, the sensitivity analysis demonstrated that C5.0 had the best performance for predicting road crash severity. Not only did C5.0 take a shorter time (0.05 s) compared to CHAID, MLP, and MLR, it also represented the highest accuracy rate for the training set. The overall prediction accuracy based on the training data was approximately 88.09% compared to 77.21% and 70.21% for CHAID and MLP models. In general, the findings of this study revealed that C5.0 can be a promising tool for predicting road crash severity. |
Databáze: | OpenAIRE |
Externí odkaz: |