Tuning Local Search by Average-Reward Reinforcement Learning

  • Authors:
  • Steven Prestwich

  • Affiliations:
  • Cork Constraint Computation Centre Department of Computer Science, University College, Cork, Ireland

  • Venue:
  • Learning and Intelligent Optimization
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Reinforcement Learning and local search have been combined in a variety of ways, in order to learn how to solve combinatorial problems more efficiently. Most approaches optimise the total reward, where the reward at each action is the change in objective function. We argue that it is more appropriate to optimise the average reward. We use R-learning to dynamically tune noise in standard SAT local search algorithms on single instances. Experiments show that noise can be successfully automated in this way.