Speeding up learning automata based multi agent systems using the concepts of stigmergy and entropy

Authors:
Behrooz Masoumi;M. R. Meybodi
Affiliations:
Department of Computer Engineering and Information Technology, Islamic Azad University, Qazvin Branch, Qazvin, Iran;Soft Computing Laboratory, Computer Engineering and Information Technology Department, Amirkabir University of Technology, Tehran, Iran
Venue:
Expert Systems with Applications: An International Journal
Year:
2011

Citing 22
Cited 2

Learning automata: an introduction

Learning automata: an introduction
The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
A brief history of stigmergy

Artificial Life
Stigmergy, self-organization, and sorting in collective robotics

Artificial Life
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Multiagent Systems: A Survey from a Machine Learning Perspective

Autonomous Robots
A Practitioners' Review of Industrial Agent Applications

Autonomous Agents and Multi-Agent Systems
Multi-agent Coordination and Control Using Stigmergy Applied to Manufacturing Control

EASSS '01 Selected Tutorial Papers from the 9th ECCAI Advanced Course ACAI 2001 and Agent Link's 3rd European Agent Systems Summer School on Multi-Agent Systems and Applications
Nash q-learning for general-sum stochastic games

The Journal of Machine Learning Research
A Pheromone-Based Utility Model for Collaborative Foraging

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 1
Multi-agent learning model with bargaining

Proceedings of the 38th conference on Winter simulation
Exploring selfish reinforcement learning in repeated games with stochastic rewards

Autonomous Agents and Multi-Agent Systems
Evaluating Learning Automata as a Model for Cooperation in Complex Multi-agent Domains

RoboCup 2006: Robot Soccer World Cup X
Strategy Entropy as a Measure of Strategy Convergence in Reinforcement Learning

ICINIS '08 Proceedings of the 2008 First International Conference on Intelligent Networks and Intelligent Systems
A Concise Introduction to Multiagent Systems and Distributed Artificial Intelligence

A Concise Introduction to Multiagent Systems and Distributed Artificial Intelligence
Analyzing stigmergetic algorithms through automata games

KDECB'06 Proceedings of the 1st international conference on Knowledge discovery and emergent complexity in bioinformatics
Pareto-Q learning algorithm for cooperative agents in general-sum games

CEEMAS'05 Proceedings of the 4th international Central and Eastern European conference on Multi-Agent Systems and Applications
Learning automata as a basis for multi agent reinforcement learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
A Comprehensive Survey of Multiagent Reinforcement Learning

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Varieties of learning automata: an overview

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Colonies of learning automata

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Decentralized Learning in Markov Games

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Machine learning and agents

KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
Adaptive learning algorithm of self-organizing teams

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	12.05

Visualization

Abstract

Learning automata (LA) were recently shown to be valuable tools for designing Multi-Agent Reinforcement Learning algorithms and are able to control the stochastic games. In this paper, the concepts of stigmergy and entropy are imported into learning automata based multi-agent systems with the purpose of providing a simple framework for interaction and coordination in multi-agent systems and speeding up the learning process. The multi-agent system considered in this paper is designed to find optimal policies in Markov games. We consider several dummy agents that walk around in the states of the environment, make local learning automaton active, and bring information so that the involved learning automaton can update their local state. The entropy of the probability vector for the learning automata of the next state is used to determine reward or penalty for the actions of learning automata. The experimental results have shown that in terms of the speed of reaching the optimal policy, the proposed algorithm has better learning performance than other learning algorithms.