CHILD: A First Step Towards Continual Learning
Machine Learning - Special issue on inductive transfer
Reactive search, a history-sensitive heuristic for MAX-SAT
Journal of Experimental Algorithmics (JEA)
Adaptive information agents in distributed textual environments
AGENTS '98 Proceedings of the second international conference on Autonomous agents
Adaptive Retrieval Agents: Internalizing Local Contextand Scaling up to the Web
Machine Learning - Special issue on information retrieval
Knowledge extraction from reinforcement learning
New learning paradigms in soft computing
Reinforcement learning for fuzzy agents: application to a pighouse environment control
New learning paradigms in soft computing
Reinforcement Learning Using the Stochastic Fuzzy Min–Max Neural Network
Neural Processing Letters
A Hybrid Architecture for Situated Learning of Reactive Sequential Decision Making
Applied Intelligence
Robot Awareness in Cooperative Mobile Robot Learning
Autonomous Robots
Embedding a Priori Knowledge in Reinforcement Learning
Journal of Intelligent and Robotic Systems
Relational Reinforcement Learning
Machine Learning
Machine Learning
Robot learning driven by emotions
Adaptive Behavior
Q-Cut - Dynamic Discovery of Sub-goals in Reinforcement Learning
ECML '02 Proceedings of the 13th European Conference on Machine Learning
A Reinforcement Learning with Condition Reduced Fuzz Rules
SEAL'98 Selected papers from the Second Asia-Pacific Conference on Simulated Evolution and Learning on Simulated Evolution and Learning
Minimax Fuzzy Q-Learning in Cooperative Multi-agent Systems
ADVIS '02 Proceedings of the Second International Conference on Advances in Information Systems
Modular-Fuzzy Cooperation Algorithm for Multi-agent Systems
ADVIS '02 Proceedings of the Second International Conference on Advances in Information Systems
Q-Learning in Continuous State and Action Spaces
AI '99 Proceedings of the 12th Australian Joint Conference on Artificial Intelligence: Advanced Topics in Artificial Intelligence
Introduction to Sequence Learning
Sequence Learning - Paradigms, Algorithms, and Applications
Continual Robot Learning with Constructive Neural Networks
EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
Complementing search engines with online web mining agents
Decision Support Systems - Special issue: Web data mining
Using relative novelty to identify useful temporal abstractions in reinforcement learning
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Dynamic abstraction in reinforcement learning via clustering
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Integrating Guidance into Relational Reinforcement Learning
Machine Learning
Learning from Multiple Sources
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Improving the Learning Rate by Inducing a Transition Model
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Identifying useful subgoals in reinforcement learning by local graph partitioning
ICML '05 Proceedings of the 22nd international conference on Machine learning
Evolutionary Function Approximation for Reinforcement Learning
The Journal of Machine Learning Research
Empirical Studies in Action Selection with Reinforcement Learning
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Application of SONQL for real-time learning of robot behaviors
Robotics and Autonomous Systems
A layered approach to learning coordination knowledge in multiagent environments
Applied Intelligence
Batch reinforcement learning in a complex domain
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning
Artificial Intelligence
An architecture based on emotions for growing up artefacts
AIC'05 Proceedings of the 5th WSEAS International Conference on Applied Informatics and Communications
Real-time dynamic fuzzy Q-learning and control of mobile robots
ICECS'03 Proceedings of the 2nd WSEAS International Conference on Electronics, Control and Signal Processing
Accelerated Neural Evolution through Cooperatively Coevolved Synapses
The Journal of Machine Learning Research
The utility of temporal abstraction in reinforcement learning
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Autonomous agent learning using an actor-critic algorithm and behavior models
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
A Connectionist Architecture for Learning to Play a Simulated Brio Labyrinth Game
KI '07 Proceedings of the 30th annual German conference on Advances in Artificial Intelligence
Model-Based Reinforcement Learning in a Complex Domain
RoboCup 2007: Robot Soccer World Cup XI
Dynamic packaging in e-retailing with stochastic demand over finite horizons: A Q-learning approach
Expert Systems with Applications: An International Journal
Using temporal-difference learning for multi-agent bargaining
Electronic Commerce Research and Applications
Agent-Based Connection Control for Digital Content Service
KES-AMSTA '07 Proceedings of the 1st KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Imitation guided learning in learning classifier systems
Natural Computing: an international journal
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
ISNN '09 Proceedings of the 6th International Symposium on Neural Networks on Advances in Neural Networks
Reinforcement learning for robot soccer
Autonomous Robots
Sample-efficient evolutionary function approximation for reinforcement learning
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Efficient reinforcement learning using recursive least-squares methods
Journal of Artificial Intelligence Research
Accelerating reinforcement learning through implicit imitation
Journal of Artificial Intelligence Research
On partially controlled multi-agent systems
Journal of Artificial Intelligence Research
Journal of Artificial Intelligence Research
State similarity based approach for improving performance in RL
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Learning to act using real-time dynamic programming
Artificial Intelligence
Evolution and incremental learning in the iterated prisoner's dilemma
IEEE Transactions on Evolutionary Computation
ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part I
Cooperative learning using advice exchange
Adaptive agents and multi-agent systems
To teach or not to teach?: decision making under uncertainty in ad hoc teams
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Auto-exploratory average reward reinforcement learning
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
ICCCI'10 Proceedings of the Second international conference on Computational collective intelligence: technologies and applications - Volume Part III
A real-time job-shop scheduling method based on adaptive agent
ROCOM'06 Proceedings of the 6th WSEAS international conference on Robotics, control and manufacturing technology
Exploiting Best-Match Equations for Efficient Reinforcement Learning
The Journal of Machine Learning Research
Automatic discovery of subgoals based on improved FCM clustering
AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part II
Making use of unelaborated advice to improve reinforcement learning: a mobile robotics approach
ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
ECML'05 Proceedings of the 16th European conference on Machine Learning
Using advice to transfer knowledge acquired in one reinforcement learning task to another
ECML'05 Proceedings of the 16th European conference on Machine Learning
Learning skills in reinforcement learning using relative novelty
SARA'05 Proceedings of the 6th international conference on Abstraction, Reformulation and Approximation
Effectiveness of considering state similarity for reinforcement learning
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Two-step gradient-based reinforcement learning for underwater robotics behavior learning
Robotics and Autonomous Systems
Teaching on a budget: agents advising agents in reinforcement learning
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Engineering Applications of Artificial Intelligence
Teaching and leading an ad hoc teammate: Collaboration without pre-coordination
Artificial Intelligence
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Automatic skill acquisition in reinforcement learning using graph centrality measures
Intelligent Data Analysis
Hi-index | 0.00 |
To date, reinforcement learning has mostly been studied solving simple learning tasks. Reinforcement learning methods that have been studied so far typically converge slowly. The purpose of this work is thus two-fold: 1) to investigate the utility of reinforcement learning in solving much more complicated learning tasks than previously studied, and 2) to investigate methods that will speed up reinforcement learning.This paper compares eight reinforcement learning frameworks: adaptive heuristic critic (AHC) learning due to Sutton, Q-learning due to Watkins, and three extensions to both basic methods for speeding up learning. The three extensions are experience replay, learning action models for planning, and teaching. The frameworks were investigated using connectionism as an approach to generalization. To evaluate the performance of different frameworks, a dynamic environment was used as a testbed. The environment is moderately complex and nondeterministic. This paper describes these frameworks and algorithms in detail and presents empirical evaluation of the frameworks.