A reinforcement learning approach to job-shop scheduling

Authors:
Wei Zhang;Thomas G. Dietterich
Affiliations:
Department of Computer Science, Oregon State University, Corvallis, Oregon;Department of Computer Science, Oregon State University, Corvallis, Oregon
Venue:
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Year:
1995

Citing 5
Cited 58

Practical Issues in Temporal Difference Learning

Machine Learning
Intelligent scheduling

Intelligent scheduling
Learning to Predict by the Methods of Temporal Differences

Machine Learning
Reinforcement Learning Applied to Linear Quadratic Regulation

Advances in Neural Information Processing Systems 5, [NIPS Conference]
Efficient training of artificial neural networks for autonomous navigation

Neural Computation

Explanation-Based Learning and Reinforcement Learning: A Unified View

Machine Learning
Learning to predict user operations for adaptive scheduling

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Reinforced Genetic Programming

Genetic Programming and Evolvable Machines
Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts

Machine Learning
ADORE: Adaptive Object Recognition

ICVS '99 Proceedings of the First International Conference on Computer Vision Systems
An Overview of MAXQ Hierarchical Reinforcement Learning

SARA '02 Proceedings of the 4th International Symposium on Abstraction, Reformulation, and Approximation
How to Design Good Results for Multiple Learning Agents in Scheduling Problems?

PRIMA '99 Proceedings of the Second Pacific Rim International Workshop on Multi-Agents: Approaches to Intelligent Agents
Different Local Search Algorithms in STAGE for Solving Bin Packing Problem

EurAsia-ICT '02 Proceedings of the First EurAsian Conference on Information and Communication Technology
Imitation and Reinforcement Learning in Agents with Heterogeneous Actions

AI '01 Proceedings of the 14th Biennial Conference of the Canadian Society on Computational Studies of Intelligence: Advances in Artificial Intelligence
An introduction to reinforcement learning theory: value function methods

Advanced lectures on machine learning
Learning evaluation functions to improve optimization by local search

The Journal of Machine Learning Research
Speedup learning for repair-based search by identifying redundant steps

The Journal of Machine Learning Research
Adaptive cognitive orthotics: combining reinforcement learning and constraint-based temporal reasoning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
General methodology 1: optimising discrete event simulation models using a reinforcement learning agent

Proceedings of the 34th conference on Winter simulation: exploring new frontiers
Active preference learning for personalized calendar scheduling assistance

Proceedings of the 10th international conference on Intelligent user interfaces
A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning

Discrete Event Dynamic Systems
Evolutionary Function Approximation for Reinforcement Learning

The Journal of Machine Learning Research
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning

Artificial Intelligence
Cooperation learning in Multi-Agent Systems with annotation and reward

International Journal of Knowledge-based and Intelligent Engineering Systems
Hierarchical Average Reward Reinforcement Learning

The Journal of Machine Learning Research
Finite-Time Bounds for Fitted Value Iteration

The Journal of Machine Learning Research
Ensemble clustering with voting active clusters

Pattern Recognition Letters
Learning While Optimizing an Unknown Fitness Surface

Learning and Intelligent Optimization
Tuning Local Search by Average-Reward Reinforcement Learning

Learning and Intelligent Optimization
QL2, a simple reinforcement learning scheme for two-player zero-sum Markov games

Neurocomputing
Reinforcement Learning: A Tutorial Survey and Recent Advances

INFORMS Journal on Computing
Multi-robot task allocation through vacancy chain scheduling

Robotics and Autonomous Systems
Learning Representation and Control in Markov Decision Processes: New Frontiers

Foundations and Trends® in Machine Learning
Solving concurrent Markov decision processes

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Adaptive sampling based large-scale stochastic resource control

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Solving factored MDPs with hybrid state and action variables

Journal of Artificial Intelligence Research
Planning with durative actions in stochastic domains

Journal of Artificial Intelligence Research
Adaptive stochastic resource control: a machine learning approach

Journal of Artificial Intelligence Research
Reinforcement learning: a survey

Journal of Artificial Intelligence Research
Infinite-horizon policy-gradient estimation

Journal of Artificial Intelligence Research
Experiments with infinite-horizon, policy-gradient estimation

Journal of Artificial Intelligence Research
A neural reinforcement learning approach to learn local dispatching policies in production scheduling

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Discriminative learning of beam-search heuristics for planning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Application of reinforcement learning for agent-based production scheduling

Engineering Applications of Artificial Intelligence
Improving iterative repair strategies for scheduling with the SVM

Neurocomputing
Transfer Learning for Reinforcement Learning Domains: A Survey

The Journal of Machine Learning Research
A reinforcement learning framework for combinatorial optimization

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2
Machine learning for intelligent systems

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Scheduling English football fixtures over the holiday period using hyper-heuristics

PPSN'10 Proceedings of the 11th international conference on Parallel problem solving from nature: Part I
Reinforcement learning based resource allocation in business process management

Data & Knowledge Engineering
A real-time job-shop scheduling method based on adaptive agent

ROCOM'06 Proceedings of the 6th WSEAS international conference on Robotics, control and manufacturing technology
Policy learning in resource-constrained optimization

Proceedings of the 13th annual conference on Genetic and evolutionary computation
Generalized TD Learning

The Journal of Machine Learning Research
Stochastic reactive production scheduling by multi-agent based asynchronous approximate dynamic programming

CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
Supervised learning linear priority dispatch rules for job-shop scheduling

LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
Learning heuristic policies – a reinforcement learning problem

LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
Transfer in reinforcement learning via shared features

The Journal of Machine Learning Research
A rapid sparsification method for kernel machines in approximate policy iteration

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
Design with shape grammars and reinforcement learning

Advanced Engineering Informatics
A learning approach to optimizing exploration---exploitation tradeoff in relevance feedback

Information Retrieval
Monte-Carlo tree search for Bayesian reinforcement learning

Applied Intelligence
Learning via human feedback in continuous state and action spaces

Applied Intelligence
Reinforcement learning algorithms with function approximation: Recent advances and applications

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

We apply reinforcement learning methods to learn domain-specific heuristics for job shop scheduling. A repair-based scheduler starts with a critical-path schedule and incrementally repairs constraint violations with the goal of finding a short conflict-free schedule. The temporal difference algorithm TD(λ) is applied to tram a neural network to learn a heuristic evaluation function over states. This evaluation function is used by a one-step lookahead search procedure to find good solutions to new scheduling problems. We evaluate this approach on synthetic problems and on problems from a NASA space shuttle pay load processing task. The evaluation function is trained on problems involving a small number of jobs and then tested on larger problems. The TD scheduler performs better than the best known existing algorithm for this task--Zwehen's iterative repair method based on simulated annealing. The results suggest that reinforcement learning can provide a new method for constructing high-performance scheduling systems.