Use Subsampling to Solve Imbalanced Dataset Problem for Automatic Incident Detection Algorithm

Autor: Miao Hua Li, Shu Yan Chen
Rok vydání: 2014
Předmět:
Zdroj: Applied Mechanics and Materials. :2114-2119
ISSN: 1662-7482
Popis: Considering the fact that the amount of traffic incident data is rare compared to the large amount of normal traffic state data in the real word, we proposed an Automatic Incident Detection (AID) algorithm based on subsampling method. First, an improved subsampling method based on Edited Nearest Neighbor Rule (ENN) algorithm was used to reconstruct the training set to get a balanced dataset. Then, the Support Vector Machine (SVM) was adopted as a classifier to detect traffic incidents. The real traffic data collected from the I-880 freeway in American was used to build the model and test the performance of the proposed AID algorithm. In addition, we made a comparison of the detection performances between the AID algorithm obtained by the original training set and the one by the relatively balanced training set. The experimental results show that the proposed AID algorithm based on subsampling is suitable for imbalanced dataset and can obtain a better detection performance.
Databáze: OpenAIRE