Route Optimization Using Q-Learning for On-Demand Bus Systems

Authors:
Naoto Mukai;Toyohide Watanabe;Jun Feng
Affiliations:
Department of Electrical Engineering, Tokyo University of Science, Tokyo, Japan 102-0073;Department of Systems and Social Informatics, Graduate School of Information Science, Department of Electrical Engineering, Nagoya University, Nagoya, Japan 464-8603;Hohai University, Nanjing, China 210098
Venue:
KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Year:
2008

Citing 2
Cited 0

Technical Note: \cal Q-Learning

Machine Learning
On the convergence of stochastic iterative dynamic programming algorithms

Neural Computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we focus on a new transport service called on-demand bus system. A major feature of the system is that buses pick up customers door-to-door when needed or required. Thus, there is no pre-determined travel routes for buses, and travel routes must be changed according to the occurrence frequency of customers. In order to find a more effective travel plan to the problem, we adopt Q-learning which is one of the machine learning algorithms. However, native Q-learning is inadequate to our target problem because the number of customers at pick-up points is time-dependent. Therefore, we improve an update process of Q values and a selection process of the next pick-up point, on the basis of time passage parameters. In particular, rewards are understated in update process, on the other hand, Q values are overstated in selection process. At the last, we report our simulation results and show the effectiveness of our algorithm for the problem.