Predicting train occupancies based on query logs and external data sources

Autor: Olivier Janssens, Joachim Van Herwegen, Ruben Verborgh, Femke Ongenae, Pieter Colpaert, Filip De Turck, Erik Mannens, Gilles Vandewiele
Jazyk: angličtina
Rok vydání: 2017
Předmět:
Zdroj: WWW'17 COMPANION : PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB
Ghent University Academic Bibliography
WWW (Companion Volume)
Popis: On dense railway networks -such as in Belgium-train travelers are frequently confronted with overly occupied trains, especially during peak hours. Crowdedness on trains leads to a deterioration in the quality of service and has a negative impact on the well-being of the passenger. In order to stimulate travelers to consider less crowded trains, the iRail project wants to show an occupancy indicator in their route planning applications by the means of predictive modeling. As there is no official occupancy data available, training data is obtained by crowd-sourcing using the iRail web app(1) and the mobile Railer application for iPhone(2). Users can indicate their departure & arrival station, at what time they took a train and classify the occupancy of that train into the classes: low, medium or high. While preliminary results on a limited dataset conclude that the models do not yet perform sufficiently well, we are convinced that with further research and a larger amount of data, our predictive model will be able to achieve higher predictive performances. All datasets used in the current research are, for that purpose, made publicly available under an open license on the iRail website(3) and in the form of a Kaggle competition(4). Moreover, an infrastructure is set up that automatically processes new logs submitted by users in order for our model to continuously learn. Occupancy predictions for future trains are made available through an api(5).
Databáze: OpenAIRE