Adaptive stochastic resource control: a machine learning approach

Authors:
Balázs Csanád Csáji;László Monostori
Affiliations:
Computer and Automation Research Institute, Hungarian Academy of Sciences, Budapest, Hungary;Computer and Automation Research Institute, Hungarian Academy of Sciences and Faculty of Mechanical Engineering, Budapest University of Technology and Economics
Venue:
Journal of Artificial Intelligence Research
Year:
2008

Citing 17
Cited 0

Mathematical control theory: deterministic finite dimensional systems (2nd ed.)

Mathematical control theory: deterministic finite dimensional systems (2nd ed.)
Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms

Machine Learning
Dynamic Programming and Optimal Control

Dynamic Programming and Optimal Control
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Neuro-Dynamic Programming

Neuro-Dynamic Programming
Value Function Based Production Scheduling

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
A Neural Reinforcement Learning Approach to Learn Local Dispatching Policies in Production Scheduling

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Learning Rates for Q-learning

The Journal of Machine Learning Research
A Distributed Decision-Making Structure for Dynamic Resource Allocation Using Nonlinear Functional Approximations

Operations Research
New Support Vector Algorithms

Neural Computation
Scheduling: Theory, Algorithms, and Systems

Scheduling: Theory, Algorithms, and Systems
Adaptive sampling based large-scale stochastic resource control

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Learning in real-time search: a unifying framework

Journal of Artificial Intelligence Research
Resource allocation among agents with MDP-induced preferences

Journal of Artificial Intelligence Research
Proactive algorithms for job shop scheduling with probabilistic durations

Journal of Artificial Intelligence Research
A reinforcement learning approach to job-shop scheduling

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Improving iterative repair strategies for scheduling with the SVM

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The paper investigates stochastic resource allocation problems with scarce, reusable resources and non-preemtive, time-dependent, interconnected tasks. This approach is a natural generalization of several standard resource management problems, such as scheduling and transportation problems. First, reactive solutions are considered and defined as control policies of suitably reformulated Markov decision processes (MDPs). We argue that this reformulation has several favorable properties, such as it has finite state and action spaces, it is aperiodic, hence all policies are proper and the space of control policies can be safely restricted. Next, approximate dynamic programming (ADP) methods, such as fitted Q-learning, are suggested for computing an efficient control policy. In order to compactly maintain the cost-to-go function, two representations are studied: hash tables and support vector regression (SVR), particularly, ν-SVRs. Several additional improvements, such as the application of limited-lookahead rollout algorithms in the initial phases, action space decomposition, task clustering and distributed sampling are investigated, too. Finally, experimental results on both benchmark and industry-related data are presented.