Exploration strategies in n-Person general-sum multiagent reinforcement learning with sequential action selection

Authors:
Ali Akramizadeh;Ahmad Afshar;Mohammad B. Menhaj
Affiliations:
Department of Electrical Engineering, Amir Kabir University of Technology, Tehran, Iran;Department of Electrical Engineering, Amir Kabir University of Technology, Tehran, Iran;Department of Electrical Engineering, Amir Kabir University of Technology, Tehran, Iran
Venue:
Intelligent Data Analysis
Year:
2011

Citing 18
Cited 0

Simplicial variable dimension algorithms for solving the nonlinear complementarity problem on a product of unit simplices using a general labelling

Mathematics of Operations Research
The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Friend-or-Foe Q-learning in General-Sum Games

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
A Game-Theoretic Approach to the Simple Coevolutionary Algorithm

PPSN VI Proceedings of the 6th International Conference on Parallel Problem Solving from Nature
A Cooperative Coevolutionary Approach to Function Optimization

PPSN III Proceedings of the International Conference on Evolutionary Computation. The Third Conference on Parallel Problem Solving from Nature: Parallel Problem Solving from Nature
Reinforcement learning of coordination in cooperative multi-agent systems

Eighteenth national conference on Artificial intelligence
Playing large games using simple strategies

Proceedings of the 4th ACM conference on Electronic commerce
Coordination in multiagent reinforcement learning: a Bayesian approach

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Algorithms for Sequential Decision Making

Algorithms for Sequential Decision Making
Nash q-learning for general-sum stochastic games

The Journal of Machine Learning Research
Asymmetric multiagent reinforcement learning

Web Intelligence and Agent Systems
Cooperative Multi-Agent Learning: The State of the Art

Autonomous Agents and Multi-Agent Systems
Algorithmic Game Theory

Algorithmic Game Theory
Different forms of the games in multiagent reinforcement learning: alternating vs. simultanous movements

MED '09 Proceedings of the 2009 17th Mediterranean Conference on Control and Automation
A Comprehensive Survey of Multiagent Reinforcement Learning

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, two novel exploration strategies are proposed for n-person general-sum multiagent reinforcement learning with sequential action selection. The existing learning process, called extensive Markov game, is considered as a set of successive extensive form games with perfect information. We introduce an estimated value for taking actions in games with respect to other agents' preferences which is called associative Q-value. They can be used to select actions probabilistically according to Boltzmann distribution. Simulation results present the effectiveness of the proposed exploration strategies that are used in our previously introduced extensive-Q learning methods. Regarding the complexity of existing methods of computing Nash equilibrium points, if it is possible to assume sequential action selection among agents, extensive-Q will be more convenient for dynamic task multiagent systems with more than two agents.