A dynamic, interpretable, and robust hybrid data analytics system for train movements in large-scale railway networks
Autor: | Luca Oneto, Alessandro Lulli, Irene Buselli, Simone Petralli, Renzo Canepa, Davide Anguita |
---|---|
Rok vydání: | 2019 |
Předmět: |
Experience-based models
0301 basic medicine Computer science Machine learning computer.software_genre Running time Train overtaking Railway network 03 medical and health sciences 0302 clinical medicine Robustness (computer science) Train movements Overtaking Train delays Interpretability business.industry Applied Mathematics Dwell time Hybrid models Computer Science Applications Management information systems Data-driven models Penalty costs 030104 developmental biology Computational Theory and Mathematics Analytics 030220 oncology & carcinogenesis Modeling and Simulation Data analysis Train Artificial intelligence Scale (map) business computer Information Systems |
Zdroj: | International Journal of Data Science and Analytics |
ISSN: | 2364-4168 2364-415X |
DOI: | 10.1007/s41060-018-00171-z |
Popis: | We investigate the problem of analysing the train movements in large-scale railway networks for the purpose of understanding and predicting their behaviour. We focus on different important aspects: the Running Time of a train between two stations, the Dwell Time of a train in a station, the Train Delay, the Penalty Costs associated to a delay, and the Train Overtaking between two trains which are in the wrong relative position on the railway network. Two main approaches exist in the literature to address these problems. One is based on the knowledge of the network and the experience of the operators. The other one is based on the analysis of the historical data about the network with advanced data analytics methods. In this paper, we will propose a hybrid approach in order to address the limitations of the current solutions. In fact, experience-based models are interpretable and robust but not really able to take into account all the factors which influence train movements resulting in low accuracy. From the other side, data-driven models are usually not easy to interpret nor robust to infrequent events and require a representative amount of data which is not always available if the phenomenon under examination changes too fast. Results on real-world data coming from the Italian railway network will show that the proposed solution outperforms both state-of-the-art experience-based and data-driven systems in terms of interpretability, robustness, ability to handle nonrecurring events and changes in the behaviour of the network, and ability to consider complex and exogenous information. |
Databáze: | OpenAIRE |
Externí odkaz: |