Coordination of communication in robot teams by reinforcement learning

Authors:
Darío Maravall;Javier de Lope;Raúl Domínguez
Affiliations:
Cognitive Robotics Group, Dept. of Artificial Intelligence, Universidad Politécnica de Madrid;Cognitive Robotics Group, Dept. of Artificial Intelligence, Universidad Politécnica de Madrid and Universidad Politécnica de Madrid;Cognitive Robotics Group, Dept. of Artificial Intelligence, Universidad Politécnica de Madrid
Venue:
IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation - Volume Part I
Year:
2011

Citing 4
Cited 0

Ant Colony Optimization

Ant Colony Optimization
Self-emergence of a common lexicon by evolution in teams of autonomous agents

Neurocomputing
Self-emergence of lexicon consensus in a population of autonomous agents by means of evolutionary strategies

HAIS'10 Proceedings of the 5th international conference on Hybrid Artificial Intelligence Systems - Volume Part II
Emergence of self-organized symbol-based communication in artificial creatures

Cognitive Systems Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

In Multi-agent systems, the study of language and communication is an active field of research. In this paper we present the application of Reinforcement Learning (RL) to the self-emergence of a common lexicon in robot teams. By modeling the vocabulary or lexicon of each agent as an association matrix or look-up table that maps the meanings (i.e. the objects encountered by the robots or the states of the environment itself) into symbols or signals we check whether it is possible for the robot team to converge in an autonomous, decentralized way to a common lexicon by means of RL, so that the communication efficiency of the entire robot team is optimal. We have conducted several experiments aimed at testing whether it is possible to converge with RL to an optimal Saussurean Communication System.We have organized our experiments alongside two main lines: first, we have investigated the effect of the team size centered on teams of moderated size in the order of 5 and 10 individuals, typical of multi-robot systems. Second, and foremost, we have also investigated the effect of the lexicon size on the convergence results. To analyze the convergence of the robot team we have defined the team's consensus when all the robots (i.e. 100% of the population) share the same association matrix or lexicon. As a general conclusion we have shown that RL allows the convergence to lexicon consensus in a population of autonomous agents.