Route Optimization Using Q-Learning for On-Demand Bus Systems

  • Authors:
  • Naoto Mukai;Toyohide Watanabe;Jun Feng

  • Affiliations:
  • Department of Electrical Engineering, Tokyo University of Science, Tokyo, Japan 102-0073;Department of Systems and Social Informatics, Graduate School of Information Science, Department of Electrical Engineering, Nagoya University, Nagoya, Japan 464-8603;Hohai University, Nanjing, China 210098

  • Venue:
  • KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we focus on a new transport service called on-demand bus system. A major feature of the system is that buses pick up customers door-to-door when needed or required. Thus, there is no pre-determined travel routes for buses, and travel routes must be changed according to the occurrence frequency of customers. In order to find a more effective travel plan to the problem, we adopt Q-learning which is one of the machine learning algorithms. However, native Q-learning is inadequate to our target problem because the number of customers at pick-up points is time-dependent. Therefore, we improve an update process of Q values and a selection process of the next pick-up point, on the basis of time passage parameters. In particular, rewards are understated in update process, on the other hand, Q values are overstated in selection process. At the last, we report our simulation results and show the effectiveness of our algorithm for the problem.