Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Cooperative Mobile Robotics: Antecedents and Directions
Autonomous Robots
Coordination for Multi-Robot Exploration and Mapping
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Cooperative multi-robot box-pushing
IROS '95 Proceedings of the International Conference on Intelligent Robots and Systems-Volume 3 - Volume 3
A selection-mutation model for q-learning in multi-agent systems
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Efficient learning equilibrium
Artificial Intelligence
An adaptive pursuit strategy for allocating operator probabilities
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Empirical Studies in Action Selection with Reinforcement Learning
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Dynamic analysis of multiagent Q-learning with ε-greedy exploration
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Accelerating reinforcement learning through implicit imitation
Journal of Artificial Intelligence Research
General game learning using knowledge transfer
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Rational and convergent learning in stochastic games
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Emergence of flocking behavior based on reinforcement learning
KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
A new Q-learning algorithm based on the metropolis criterion
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Hi-index | 0.00 |
This paper evaluates the performances of the reported q-learning policies for multi-agent systems. A set of extensively used policies were identified in the open literature namely greedy, ε-greedy, Boltzmann Distribution, Simulated Annealing and Probabiliy Matching. Five agents are modeled to search and retrieve pucks back to a home location in the environment under specified constraints. A number of simulation-based experiments was conducted and based on the numerical results that was obtained, the performances of the learning policies are discussed.