Practical Issues in Temporal Difference Learning
Machine Learning
Intelligent scheduling
Learning to Predict by the Methods of Temporal Differences
Machine Learning
Reinforcement Learning Applied to Linear Quadratic Regulation
Advances in Neural Information Processing Systems 5, [NIPS Conference]
Efficient training of artificial neural networks for autonomous navigation
Neural Computation
Learning to predict user operations for adaptive scheduling
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Reinforced Genetic Programming
Genetic Programming and Evolvable Machines
ADORE: Adaptive Object Recognition
ICVS '99 Proceedings of the First International Conference on Computer Vision Systems
An Overview of MAXQ Hierarchical Reinforcement Learning
SARA '02 Proceedings of the 4th International Symposium on Abstraction, Reformulation, and Approximation
How to Design Good Results for Multiple Learning Agents in Scheduling Problems?
PRIMA '99 Proceedings of the Second Pacific Rim International Workshop on Multi-Agents: Approaches to Intelligent Agents
Different Local Search Algorithms in STAGE for Solving Bin Packing Problem
EurAsia-ICT '02 Proceedings of the First EurAsian Conference on Information and Communication Technology
Imitation and Reinforcement Learning in Agents with Heterogeneous Actions
AI '01 Proceedings of the 14th Biennial Conference of the Canadian Society on Computational Studies of Intelligence: Advances in Artificial Intelligence
An introduction to reinforcement learning theory: value function methods
Advanced lectures on machine learning
Learning evaluation functions to improve optimization by local search
The Journal of Machine Learning Research
Speedup learning for repair-based search by identifying redundant steps
The Journal of Machine Learning Research
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Proceedings of the 34th conference on Winter simulation: exploring new frontiers
Active preference learning for personalized calendar scheduling assistance
Proceedings of the 10th international conference on Intelligent user interfaces
A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning
Discrete Event Dynamic Systems
Evolutionary Function Approximation for Reinforcement Learning
The Journal of Machine Learning Research
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning
Artificial Intelligence
Cooperation learning in Multi-Agent Systems with annotation and reward
International Journal of Knowledge-based and Intelligent Engineering Systems
Hierarchical Average Reward Reinforcement Learning
The Journal of Machine Learning Research
Finite-Time Bounds for Fitted Value Iteration
The Journal of Machine Learning Research
Ensemble clustering with voting active clusters
Pattern Recognition Letters
Learning While Optimizing an Unknown Fitness Surface
Learning and Intelligent Optimization
Tuning Local Search by Average-Reward Reinforcement Learning
Learning and Intelligent Optimization
Reinforcement Learning: A Tutorial Survey and Recent Advances
INFORMS Journal on Computing
Multi-robot task allocation through vacancy chain scheduling
Robotics and Autonomous Systems
Learning Representation and Control in Markov Decision Processes: New Frontiers
Foundations and Trends® in Machine Learning
Solving concurrent Markov decision processes
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Adaptive sampling based large-scale stochastic resource control
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Solving factored MDPs with hybrid state and action variables
Journal of Artificial Intelligence Research
Planning with durative actions in stochastic domains
Journal of Artificial Intelligence Research
Adaptive stochastic resource control: a machine learning approach
Journal of Artificial Intelligence Research
Reinforcement learning: a survey
Journal of Artificial Intelligence Research
Infinite-horizon policy-gradient estimation
Journal of Artificial Intelligence Research
Experiments with infinite-horizon, policy-gradient estimation
Journal of Artificial Intelligence Research
IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Discriminative learning of beam-search heuristics for planning
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Application of reinforcement learning for agent-based production scheduling
Engineering Applications of Artificial Intelligence
Transfer Learning for Reinforcement Learning Domains: A Survey
The Journal of Machine Learning Research
A reinforcement learning framework for combinatorial optimization
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2
Machine learning for intelligent systems
AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Scheduling English football fixtures over the holiday period using hyper-heuristics
PPSN'10 Proceedings of the 11th international conference on Parallel problem solving from nature: Part I
Reinforcement learning based resource allocation in business process management
Data & Knowledge Engineering
A real-time job-shop scheduling method based on adaptive agent
ROCOM'06 Proceedings of the 6th WSEAS international conference on Robotics, control and manufacturing technology
Policy learning in resource-constrained optimization
Proceedings of the 13th annual conference on Genetic and evolutionary computation
The Journal of Machine Learning Research
CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
Supervised learning linear priority dispatch rules for job-shop scheduling
LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
Learning heuristic policies – a reinforcement learning problem
LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
Transfer in reinforcement learning via shared features
The Journal of Machine Learning Research
A rapid sparsification method for kernel machines in approximate policy iteration
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
Design with shape grammars and reinforcement learning
Advanced Engineering Informatics
Monte-Carlo tree search for Bayesian reinforcement learning
Applied Intelligence
Learning via human feedback in continuous state and action spaces
Applied Intelligence
Reinforcement learning algorithms with function approximation: Recent advances and applications
Information Sciences: an International Journal
Hi-index | 0.00 |
We apply reinforcement learning methods to learn domain-specific heuristics for job shop scheduling. A repair-based scheduler starts with a critical-path schedule and incrementally repairs constraint violations with the goal of finding a short conflict-free schedule. The temporal difference algorithm TD(λ) is applied to tram a neural network to learn a heuristic evaluation function over states. This evaluation function is used by a one-step lookahead search procedure to find good solutions to new scheduling problems. We evaluate this approach on synthetic problems and on problems from a NASA space shuttle pay load processing task. The evaluation function is trained on problems involving a small number of jobs and then tested on larger problems. The TD scheduler performs better than the best known existing algorithm for this task--Zwehen's iterative repair method based on simulated annealing. The results suggest that reinforcement learning can provide a new method for constructing high-performance scheduling systems.