A generic architecture for adaptive agents based on reinforcement learning

Authors:
Philippe Preux;Samuel Delepoulle;Jean-Claude Darcheville
Affiliations:
Laboratoire d'Informatique du Littoral (LIL), UPRES-JE 2335, Université du Littoral Côte d'Opale, B.P. 719, 62228 Calais Cedex, France;Laboratoire d'Informatique du Littoral (LIL), UPRES-JE 2335, Université du Littoral Côte d'Opale, B.P. 719, 62228 Calais Cedex, France;Unité de Recherche sur l'Évolution des Comportements et des Apprentissages (URECA), UPRES-EA 1059, Université de Lille 3, B.P. 149, 59653 Villeneuve d'Ascq Cedex, France
Venue:
Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Bio-inspired systems (BIS)
Year:
2004

Citing 13
Cited 9

What is cognitive and what is not cognitive?

SAB94 Proceedings of the third international conference on Simulation of adaptive behavior : from animals to animats 3: from animals to animats 3
Temporal difference learning and TD-Gammon

Communications of the ACM
Some studies in machine learning using the game of checkers

Computers & thought
Multi-agent reinforcement learning: independent vs. cooperative agents

Readings in agents
Learning to do without cognition

Proceedings of the fifth international conference on simulation of adaptive behavior on From animals to animats 5
Cambrian intelligence: the early history of the new AI

Cambrian intelligence: the early history of the new AI
Robot Shaping: An Experiment in Behavior Engineering

Robot Shaping: An Experiment in Behavior Engineering
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Neuro-Dynamic Programming

Neuro-Dynamic Programming
Using Reinforcement Learning to Spider the Web Efficiently

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
On Multiagent Q-Learning in a Semi-Competitive Domain

IJCAI '95 Proceedings of the Workshop on Adaption and Learning in Multi-Agent Systems
Temporal Difference Model Reproduces Anticipatory Neural Activity

Neural Computation
Evolution and development of neural controllers for locomotion, gradient-following, and obstacle-avoidance in artificial insects

IEEE Transactions on Neural Networks

A fuzzy Actor-Critic reinforcement learning network

Information Sciences: an International Journal
H∞ reinforcement learning control of robot manipulators using fuzzy wavelet networks

Fuzzy Sets and Systems
A novel approach for multi-agent-based Intelligent Manufacturing System

Information Sciences: an International Journal
An agent-oriented approach to resolve scheduling optimization in intelligent manufacturing

Robotics and Computer-Integrated Manufacturing
Empirical analysis of an on-line adaptive system using a mixture of Bayesian networks

Information Sciences: an International Journal
Hessian matrix distribution for Bayesian policy gradient reinforcement learning

Information Sciences: an International Journal
Self-organizing state aggregation for architecture design of Q-learning

Information Sciences: an International Journal
Machine learning and agents

KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
A multi-viewpoint system to support abductive reasoning

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present MAABAC, a generic model for building adaptive agents: they learn new behaviors by interacting with their environment. These agents adapt their behavior by way of reinforcement learning, namely temporal difference methods. MAABAC is presented in its generality and then, different instantiations of the generic model are presented and experiments are reported. These experiments show the strength of this way of learning.