A reinforcement learning algorithm to minimize the mean tardiness of a single machine with controlled capacity

Authors:
Hadeel D. Idrees;Mahdy O. Sinnokrot;Sameh Al-Shihabi
Affiliations:
University of Jordan, Amman, Jordan;University of Jordan, Amman, Jordan;University of Jordan, Amman, Jordan
Venue:
Proceedings of the 38th conference on Winter simulation
Year:
2006

Citing 4
Cited 0

Average reward reinforcement learning: foundations, algorithms, and empirical results

Machine Learning - Special issue on reinforcement learning
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Application of reinforcement learning to multi-agent production scheduling

Application of reinforcement learning to multi-agent production scheduling
Reinforcement learning: a survey

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this work, we consider the problem of scheduling arriving jobs to a single machine where the objective is to minimize the mean tardiness. The scheduler has the option of reducing the processing time by half through the employment of an extra worker for an extra cost per job (setup cost). The scheduler can also choose from a number of dispatching rules. To find a good policy to be followed by the scheduler, we implemented a λ-SMART algorithm to do an on-line optimization for the studied system. The found policy is only optimal with respect to the state representation and set of actions available, however, we believe that the developed policies are easy to implement and would result in considerable savings as shown by the numerical experiments conducted.