A cross entropy multiagent learning algorithm for solving vehicle routing problems with time windows

  • Authors:
  • Tai-Yu Ma

  • Affiliations:
  • LET, ISH, Lyon Cedex

  • Venue:
  • ICCL'11 Proceedings of the Second international conference on Computational logistics
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The vehicle routing problem with time windows (VRPTW) has been the subject of intensive study because of its importance in real applications. In this paper, we propose a cross entropy multiagent learning algorithm, which considers an optimum solution as a rare event to be learned. The routing policy is node-distributed, controlled by a set of parameterized probability distribution functions. Based on the performance of experienced tours of vehicle agents, these parameters are updated iteratively by minimizing Kullback-Leibler cross entropy in order to generate better solutions in next iterations. When applying the proposed algorithm on Solomon's 100-customer problem set, it shows outperforming results in comparison with the classical cross entropy approach. Moreover, this method needs only very small number of parameter settings. Its implementation is also relatively simple and flexible to solve other vehicle routing problems under various dynamic scenarios.