What is cognitive and what is not cognitive?
SAB94 Proceedings of the third international conference on Simulation of adaptive behavior : from animals to animats 3: from animals to animats 3
Temporal difference learning and TD-Gammon
Communications of the ACM
Some studies in machine learning using the game of checkers
Computers & thought
Multi-agent reinforcement learning: independent vs. cooperative agents
Readings in agents
Learning to do without cognition
Proceedings of the fifth international conference on simulation of adaptive behavior on From animals to animats 5
Cambrian intelligence: the early history of the new AI
Cambrian intelligence: the early history of the new AI
Robot Shaping: An Experiment in Behavior Engineering
Robot Shaping: An Experiment in Behavior Engineering
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Neuro-Dynamic Programming
Using Reinforcement Learning to Spider the Web Efficiently
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
On Multiagent Q-Learning in a Semi-Competitive Domain
IJCAI '95 Proceedings of the Workshop on Adaption and Learning in Multi-Agent Systems
Temporal Difference Model Reproduces Anticipatory Neural Activity
Neural Computation
IEEE Transactions on Neural Networks
A fuzzy Actor-Critic reinforcement learning network
Information Sciences: an International Journal
H∞ reinforcement learning control of robot manipulators using fuzzy wavelet networks
Fuzzy Sets and Systems
A novel approach for multi-agent-based Intelligent Manufacturing System
Information Sciences: an International Journal
An agent-oriented approach to resolve scheduling optimization in intelligent manufacturing
Robotics and Computer-Integrated Manufacturing
Empirical analysis of an on-line adaptive system using a mixture of Bayesian networks
Information Sciences: an International Journal
Hessian matrix distribution for Bayesian policy gradient reinforcement learning
Information Sciences: an International Journal
Self-organizing state aggregation for architecture design of Q-learning
Information Sciences: an International Journal
KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
A multi-viewpoint system to support abductive reasoning
Information Sciences: an International Journal
Hi-index | 0.00 |
In this paper, we present MAABAC, a generic model for building adaptive agents: they learn new behaviors by interacting with their environment. These agents adapt their behavior by way of reinforcement learning, namely temporal difference methods. MAABAC is presented in its generality and then, different instantiations of the generic model are presented and experiments are reported. These experiments show the strength of this way of learning.