Application of the nested rollout policy adaptation algorithm to the traveling salesman problem with time windows

  • Authors:
  • Tristan Cazenave;Fabien Teytaud

  • Affiliations:
  • LAMSADE, Université Paris Dauphine, France;LAMSADE, Université Paris Dauphine, France,HEC Paris, CNRS, Jouy-en-Josas, France

  • Venue:
  • LION'12 Proceedings of the 6th international conference on Learning and Intelligent Optimization
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we are interested in the minimization of the travel cost of the traveling salesman problem with time windows. In order to do this minimization we use a Nested Rollout Policy Adaptation (NRPA) algorithm. NRPA has multiple levels and maintains the best tour at each level. It consists in learning a rollout policy at each level. We also show how to improve the original algorithm with a modified rollout policy that helps NRPA to avoid time windows violations.