Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching

Authors:
Long-Ji Lin
Affiliations:
School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213. ljl@cs.cmu.edu
Venue:
Machine Learning
Year:
1992

Citing 0
Cited 81

CHILD: A First Step Towards Continual Learning

Machine Learning - Special issue on inductive transfer
Explanation-Based Learning and Reinforcement Learning: A Unified View

Machine Learning
Reactive search, a history-sensitive heuristic for MAX-SAT

Journal of Experimental Algorithmics (JEA)
Adaptive information agents in distributed textual environments

AGENTS '98 Proceedings of the second international conference on Autonomous agents
Adaptive Retrieval Agents: Internalizing Local Contextand Scaling up to the Web

Machine Learning - Special issue on information retrieval
Knowledge extraction from reinforcement learning

New learning paradigms in soft computing
Reinforcement learning for fuzzy agents: application to a pighouse environment control

New learning paradigms in soft computing
Reinforcement Learning Using the Stochastic Fuzzy Min–Max Neural Network

Neural Processing Letters
A Hybrid Architecture for Situated Learning of Reactive Sequential Decision Making

Applied Intelligence
Robot Awareness in Cooperative Mobile Robot Learning

Autonomous Robots
Embedding a Priori Knowledge in Reinforcement Learning

Journal of Intelligent and Robotic Systems
Relational Reinforcement Learning

Machine Learning
Continuous-Action Q-Learning

Machine Learning
Robot learning driven by emotions

Adaptive Behavior
Q-Cut - Dynamic Discovery of Sub-goals in Reinforcement Learning

ECML '02 Proceedings of the 13th European Conference on Machine Learning
A Reinforcement Learning with Condition Reduced Fuzz Rules

SEAL'98 Selected papers from the Second Asia-Pacific Conference on Simulated Evolution and Learning on Simulated Evolution and Learning
Minimax Fuzzy Q-Learning in Cooperative Multi-agent Systems

ADVIS '02 Proceedings of the Second International Conference on Advances in Information Systems
Modular-Fuzzy Cooperation Algorithm for Multi-agent Systems

ADVIS '02 Proceedings of the Second International Conference on Advances in Information Systems
Q-Learning in Continuous State and Action Spaces

AI '99 Proceedings of the 12th Australian Joint Conference on Artificial Intelligence: Advanced Topics in Artificial Intelligence
Introduction to Sequence Learning

Sequence Learning - Paradigms, Algorithms, and Applications
Continual Robot Learning with Constructive Neural Networks

EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
Complementing search engines with online web mining agents

Decision Support Systems - Special issue: Web data mining
Distributed Reinforcement Learning Control for Batch Sequencing and Sizing in Just-In-Time Manufacturing Systems

Applied Intelligence
Using relative novelty to identify useful temporal abstractions in reinforcement learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Dynamic abstraction in reinforcement learning via clustering

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Integrating Guidance into Relational Reinforcement Learning

Machine Learning
Learning from Multiple Sources

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Improving the Learning Rate by Inducing a Transition Model

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Teaching robots to plan through Q-learning

Robotica
Identifying useful subgoals in reinforcement learning by local graph partitioning

ICML '05 Proceedings of the 22nd international conference on Machine learning
Online Evolution for a Self-Adapting Robotic Navigation System Using Evolvable Hardware

Artificial Life
A Neural Learning Classifier System with Self-Adaptive Constructivism for Mobile Robot Control

Artificial Life
Evolutionary Function Approximation for Reinforcement Learning

The Journal of Machine Learning Research
Empirical Studies in Action Selection with Reinforcement Learning

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Application of SONQL for real-time learning of robot behaviors

Robotics and Autonomous Systems
A layered approach to learning coordination knowledge in multiagent environments

Applied Intelligence
Batch reinforcement learning in a complex domain

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning

Artificial Intelligence
An architecture based on emotions for growing up artefacts

AIC'05 Proceedings of the 5th WSEAS International Conference on Applied Informatics and Communications
Real-time dynamic fuzzy Q-learning and control of mobile robots

ICECS'03 Proceedings of the 2nd WSEAS International Conference on Electronics, Control and Signal Processing
Accelerated Neural Evolution through Cooperatively Coevolved Synapses

The Journal of Machine Learning Research
The utility of temporal abstraction in reinforcement learning

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Autonomous agent learning using an actor-critic algorithm and behavior models

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
A Connectionist Architecture for Learning to Play a Simulated Brio Labyrinth Game

KI '07 Proceedings of the 30th annual German conference on Advances in Artificial Intelligence
Model-Based Reinforcement Learning in a Complex Domain

RoboCup 2007: Robot Soccer World Cup XI
Dynamic packaging in e-retailing with stochastic demand over finite horizons: A Q-learning approach

Expert Systems with Applications: An International Journal
Using temporal-difference learning for multi-agent bargaining

Electronic Commerce Research and Applications
Agent-Based Connection Control for Digital Content Service

KES-AMSTA '07 Proceedings of the 1st KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Imitation guided learning in learning classifier systems

Natural Computing: an international journal
A Hierarchical Autonomous Robot Controller for Learning and Memory: Adaptation in a Dynamic Environment

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Using Strongly Connected Components as a Basis for Autonomous Skill Acquisition in Reinforcement Learning

ISNN '09 Proceedings of the 6th International Symposium on Neural Networks on Advances in Neural Networks
Reinforcement learning for robot soccer

Autonomous Robots
Sample-efficient evolutionary function approximation for reinforcement learning

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Giving advice about preferred actions to reinforcement learners via knowledge-based kernel regression

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Efficient reinforcement learning using recursive least-squares methods

Journal of Artificial Intelligence Research
Accelerating reinforcement learning through implicit imitation

Journal of Artificial Intelligence Research
On partially controlled multi-agent systems

Journal of Artificial Intelligence Research
Truncating temporal differences: on the efficient implementation of TD (λ) for reinforcement learning

Journal of Artificial Intelligence Research
State similarity based approach for improving performance in RL

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Learning to act using real-time dynamic programming

Artificial Intelligence
Evolution and incremental learning in the iterated prisoner's dilemma

IEEE Transactions on Evolutionary Computation
Multiagent Reinforcement Learning with Spiking and Non-Spiking Agents in the Iterated Prisoner's Dilemma

ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part I
Cooperative learning using advice exchange

Adaptive agents and multi-agent systems
To teach or not to teach?: decision making under uncertainty in ad hoc teams

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Auto-exploratory average reward reinforcement learning

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
An innovative routing algorithm with reinforcement learning and pattern tree adjustment for wireless sensor networks

ICCCI'10 Proceedings of the Second international conference on Computational collective intelligence: technologies and applications - Volume Part III
A real-time job-shop scheduling method based on adaptive agent

ROCOM'06 Proceedings of the 6th WSEAS international conference on Robotics, control and manufacturing technology
Exploiting Best-Match Equations for Efficient Reinforcement Learning

The Journal of Machine Learning Research
Automatic discovery of subgoals based on improved FCM clustering

AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part II
Making use of unelaborated advice to improve reinforcement learning: a mobile robotics approach

ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
Neural fitted q iteration – first experiences with a data efficient neural reinforcement learning method

ECML'05 Proceedings of the 16th European conference on Machine Learning
Using advice to transfer knowledge acquired in one reinforcement learning task to another

ECML'05 Proceedings of the 16th European conference on Machine Learning
Learning skills in reinforcement learning using relative novelty

SARA'05 Proceedings of the 6th international conference on Abstraction, Reformulation and Approximation
Effectiveness of considering state similarity for reinforcement learning

IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Two-step gradient-based reinforcement learning for underwater robotics behavior learning

Robotics and Autonomous Systems
Teaching on a budget: agents advising agents in reinforcement learning

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
A novel modular Q-learning architecture to improve performance under incomplete learning in a grid soccer game

Engineering Applications of Artificial Intelligence
Teaching and leading an ad hoc teammate: Collaboration without pre-coordination

Artificial Intelligence
Employing batch reinforcement learning to control gene regulation without explicitly constructing gene regulatory networks

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems

Automatica (Journal of IFAC)
Automatic skill acquisition in reinforcement learning using graph centrality measures

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

To date, reinforcement learning has mostly been studied solving simple learning tasks. Reinforcement learning methods that have been studied so far typically converge slowly. The purpose of this work is thus two-fold: 1) to investigate the utility of reinforcement learning in solving much more complicated learning tasks than previously studied, and 2) to investigate methods that will speed up reinforcement learning.This paper compares eight reinforcement learning frameworks: adaptive heuristic critic (AHC) learning due to Sutton, Q-learning due to Watkins, and three extensions to both basic methods for speeding up learning. The three extensions are experience replay, learning action models for planning, and teaching. The frameworks were investigated using connectionism as an approach to generalization. To evaluate the performance of different frameworks, a dynamic environment was used as a testbed. The environment is moderately complex and nondeterministic. This paper describes these frameworks and algorithms in detail and presents empirical evaluation of the frameworks.