Classification of Livebus arrivals user behavior |
| |
Authors: | Natalia Selini Hadjidimitriou Marco Mamei Mauro Dell'Amico Ioannis Kaparias |
| |
Affiliation: | 1. Department of Sciences and Methods for Engineering, University of Modena and Reggio Emilia, Reggio Emilia, Italy;2. Faculty of Engineering and the Environment, University of Southampton, Southampton, UK |
| |
Abstract: | With the increasing use of Intelligent Transport Systems, large amounts of data are created. Innovative information services are introduced and new forms of data are available, which could be used to understand the behavior of travelers and the dynamics of people flows. This work analyzes the requests for real-time arrivals of bus routes at stops in London made by travelers using Transport for London's LiveBus Arrivals system. The available dataset consists of about one million requests for real-time arrivals for each of the 28 days under observation. These data are analyzed for different purposes. LiveBus Arrivals users are classified based on a set of features and using K-Means, Expectation Maximization, Logistic regression, One-level decision tree, Decision Tree, Random Forest, and Support Vector Machine (SVM) by Sequential Minimal Optimization (SMO). The results of the study indicate that the LiveBus Arrivals requests can be classified into six main behaviors. It was found that the classification-based approaches produce better results than the clustering-based ones. The most accurate results were obtained with the SVM-SMO methodology (Precision of 97%). Furthermore, the behavior within the six classes of users is analyzed to better understand how users take advantage of the LiveBus Arrivals service. It was found that the 37% of users can be classified as interchange users. This classification could form the basis of a more personalized LiveBus Arrivals application in future, which could support management and planning by revealing how public transport and related services are actually used or update information on commuters. |
| |
Keywords: | Clustering algorithms data-driven behavior data mining intelligent transport system public transport real time information supervised classification |
|
|