Elevator Group Control Using Multiple Reinforcement Learning Agents

Authors:
Robert H. Crites;Andrew G. Barto
Affiliations:
Unica Technologies, Inc., 55 Old Bedford Road, Lincoln, MA 01773. rcrites@unica-usa.com;University of Massachusetts, Department of Computer Science, Amherst, MA 01003. barto@cs.umass.edu
Venue:
Machine Learning
Year:
1998

Citing 17
Cited 42

Adaptive signal processing

Adaptive signal processing
Learning automata: an introduction

Learning automata: an introduction
From Chemotaxis to cooperativity: abstract exercises in neuronal learning strategies

The computing neuron
Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations
Practical Issues in Temporal Difference Learning

Machine Learning
A dynamic load balancing approach to the control of multi-server polling systems with applications to elevator system dispatching

A dynamic load balancing approach to the control of multi-server polling systems with applications to elevator system dispatching
TD-Gammon, a self-teaching backgammon program, achieves master-level play

Neural Computation
Learning to act using real-time dynamic programming

Artificial Intelligence - Special volume on computational research on interaction and agency, part 1
Temporal difference learning and TD-Gammon

Communications of the ACM
Some studies in machine learning using the game of checkers

Computers & thought
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Neuro-Dynamic Programming

Neuro-Dynamic Programming
Feudal Reinforcement Learning

Advances in Neural Information Processing Systems 5, [NIPS Conference]
Adaptation and Learning in Multi-Agent Systems: Some Remarks and a Bibliography

IJCAI '95 Proceedings of the Workshop on Adaption and Learning in Multi-Agent Systems
A Distributed Reinforcement Learning Scheme for Network Routing

A Distributed Reinforcement Learning Scheme for Network Routing
Large-scale dynamic optimization using teams of reinforcement learning agents

Large-scale dynamic optimization using teams of reinforcement learning agents
Algorithms for sequential decision-making

Algorithms for sequential decision-making

Hierarchical multi-agent reinforcement learning

Proceedings of the fifth international conference on Autonomous agents
Recent Advances in Hierarchical Reinforcement Learning

Discrete Event Dynamic Systems
A Topic-Specific Web Robot Model Based on Restless Bandits

IEEE Internet Computing
Learning to play strong poker

Machines that learn to play games
An Architectural Framework for Integrated Multiagent Planning, Reacting, and Learning

ATAL '00 Proceedings of the 7th International Workshop on Intelligent Agents VII. Agent Theories Architectures and Languages
Spatiotemporal Abstraction of Stochastic Sequential Processes

Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation
Recent Advances in Hierarchical Reinforcement Learning

Discrete Event Dynamic Systems
Lyapunov design for safe reinforcement learning

The Journal of Machine Learning Research
Learning to Communicate and Act Using Hierarchical Reinforcement Learning

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Coordinating Multiple Agents via Reinforcement Learning

Autonomous Agents and Multi-Agent Systems
A Reinforcement Learning Scheme for a Partially-Observable Multi-Agent Game

Machine Learning
Online-optimization of multi-elevator transport systems with reoptimization algorithms based on set-partitioning models

Discrete Applied Mathematics - Special issue: Traces of the Latin American conference on combinatorics, graphs and applications: a selection of papers from LACGA 2004, Santiago, Chile
Neural-based downlink scheduling algorithm for broadband wireless networks

Computer Communications
Evolutionary Function Approximation for Reinforcement Learning

The Journal of Machine Learning Research
Empirical Studies in Action Selection with Reinforcement Learning

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
A Graph-Based Evolutionary Algorithm: Genetic Network Programming (GNP) and Its Extension Using Reinforcement Learning

Evolutionary Computation
Model-Based Reinforcement Learning for Partially Observable Games with Sampling-Based State Estimation

Neural Computation
Application of reinforcement learning to the game of Othello

Computers and Operations Research
Coordination in multiagent reinforcement learning systems by virtual reinforcement signals

International Journal of Knowledge-based and Intelligent Engineering Systems
Hierarchical Average Reward Reinforcement Learning

The Journal of Machine Learning Research
Reinforcement learning for problems with symmetrical restricted states

Robotics and Autonomous Systems
Sample-efficient evolutionary function approximation for reinforcement learning

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Accelerating reinforcement learning through implicit imitation

Journal of Artificial Intelligence Research
Risk-sensitive reinforcement learning applied to control under constraints

Journal of Artificial Intelligence Research
Natural actor-critic algorithms

Automatica (Journal of IFAC)
Brief paper: Development of a self-tuning fuzzy logic controller for intelligent control of elevator systems

Engineering Applications of Artificial Intelligence
Adaptive data-aware utility-based scheduling in resource-constrained systems

Adaptive data-aware utility-based scheduling in resource-constrained systems
RL-Glue: Language-Independent Software for Reinforcement-Learning Experiments

The Journal of Machine Learning Research
Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning

Autonomous Agents and Multi-Agent Systems
Extending adaptive fuzzy behavior hierarchies to multiple levels of composite behaviors

Robotics and Autonomous Systems
Adaptive data-aware utility-based scheduling in resource-constrained systems

Journal of Parallel and Distributed Computing
Intelligent negotiation behaviour model for an open railway access market

Expert Systems with Applications: An International Journal
Coordinated learning in multiagent MDPs with infinite state-space

Autonomous Agents and Multi-Agent Systems
Marginalizing out future passengers in group elevator control

UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
An efficient control method for elevator group control system

CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part II
Structural abstraction experiments in reinforcement learning

AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
Elevator group control by using talented algorithm

TAINN'05 Proceedings of the 14th Turkish conference on Artificial Intelligence and Neural Networks
Emergent consensus in decentralised systems using collaborative reinforcement learning

Self-star Properties in Complex Information Systems
A distributed learning control system for elevator groups

ICAISC'06 Proceedings of the 8th international conference on Artificial Intelligence and Soft Computing
A rapid sparsification method for kernel machines in approximate policy iteration

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
Novel method for using Q-learning in small microcontrollers

Proceedings of the 51st ACM Southeast Conference
Reinforcement learning algorithms with function approximation: Recent advances and applications

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recent algorithmic and theoretical advances in reinforcement learning (RL) have attracted widespread interest. RL algorithms have appeared that approximate dynamic programming on an incremental basis. They can be trained on the basis of real or simulated experiences, focusing their computation on areas of state space that are actually visited during control, making them computationally tractable on very large problems. If each member of a team of agents employs one of these algorithms, a new collective learning algorithm emerges for the team as a whole. In this paper we demonstrate that such collective RL algorithms can be powerful heuristic methods for addressing large-scale control problems.Elevator group control serves as our testbed. It is a difficult domain posing a combination of challenges not seen in most multi-agent learning research to date. We use a team of RL agents, each of which is responsible for controlling one elevator car. The team receives a global reward signal which appears noisy to each agent due to the effects of the actions of the other agents, the random nature of the arrivals and the incomplete observation of the state. In spite of these complications, we show results that in simulation surpass the best of the heuristic elevator control algorithms of which we are aware. These results demonstrate the power of multi-agent RL on a very large scale stochastic dynamic optimization problem of practical utility.