Predicting train occupancies based on query logs and external data sources
Autor: | Olivier Janssens, Joachim Van Herwegen, Ruben Verborgh, Femke Ongenae, Pieter Colpaert, Filip De Turck, Erik Mannens, Gilles Vandewiele |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2017 |
Předmět: |
public transport
050210 logistics & transportation Technology and Engineering Occupancy business.industry Computer science Quality of service 05 social sciences 02 engineering and technology Linked data linked data computer.software_genre Transport engineering Set (abstract data type) Public transport 0502 economics and business 0202 electrical engineering electronic engineering information engineering Web application 020201 artificial intelligence & image processing IBCN Data mining business computer License predictive modeling |
Zdroj: | WWW'17 COMPANION : PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB Ghent University Academic Bibliography WWW (Companion Volume) |
Popis: | On dense railway networks -such as in Belgium-train travelers are frequently confronted with overly occupied trains, especially during peak hours. Crowdedness on trains leads to a deterioration in the quality of service and has a negative impact on the well-being of the passenger. In order to stimulate travelers to consider less crowded trains, the iRail project wants to show an occupancy indicator in their route planning applications by the means of predictive modeling. As there is no official occupancy data available, training data is obtained by crowd-sourcing using the iRail web app(1) and the mobile Railer application for iPhone(2). Users can indicate their departure & arrival station, at what time they took a train and classify the occupancy of that train into the classes: low, medium or high. While preliminary results on a limited dataset conclude that the models do not yet perform sufficiently well, we are convinced that with further research and a larger amount of data, our predictive model will be able to achieve higher predictive performances. All datasets used in the current research are, for that purpose, made publicly available under an open license on the iRail website(3) and in the form of a Kaggle competition(4). Moreover, an infrastructure is set up that automatically processes new logs submitted by users in order for our model to continuously learn. Occupancy predictions for future trains are made available through an api(5). |
Databáze: | OpenAIRE |
Externí odkaz: |