Reinforcement Learning in Swarms that Learn

Authors:
James F. Peters;Christopher Henry;Sheela Ramanna
Affiliations:
University of Manitoba Department of Electrical and Computer Engineering 15 Gillson St., Winnipeg, Manitoba, Canada R3T 5V6;University of Manitoba Department of Electrical and Computer Engineering 15 Gillson St., Winnipeg, Manitoba, Canada R3T 5V6;Department of Applied Computer Science University of Winnipeg Winnipeg, Manitoba R3B 2E9 Canada
Venue:
IAT '05 Proceedings of the IEEE/WIC/ACM International Conference on Intelligent Agent Technology
Year:
2005

Citing 0
Cited 6

Approximation spaces in off-policy Monte Carlo learning

Engineering Applications of Artificial Intelligence
Biologically-inspired adaptive learning control strategies: A rough set approach

International Journal of Hybrid Intelligent Systems
Measuring Resemblances Between Swarm Behaviours: A Perceptual Tolerance Near Set Approach

Fundamenta Informaticae - Swarm Intelligence
A Swarm-Based Learning Method Inspired by Social Insects

ICIC '07 Proceedings of the 3rd International Conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence
Measuring Resemblances Between Swarm Behaviours: A Perceptual Tolerance Near Set Approach

Fundamenta Informaticae - Swarm Intelligence
Interactive information systems: Toward perception based computing

Theoretical Computer Science

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper introduces an approach to reinforcement learning by cooperating agents using a variation of the actor critic method. This is made possible by considering behavior patterns of swarms in the context of approximation spaces. Rough set theory introduced by Zdzis - law Pawlak in 1982 provides a ground for deriving pattern-based rewards within approximation spaces. The framework provided by an approximation space makes it possible to derive pattern-based reference rewards used to estimate action preferences. Approximation spaces are used to derive action-based reference rewards at the swarm intelligence level. Two different forms of the actor critic reinforcement learning method are considered as a part of a study of learning in realtime by a swarm. The contribution of this article is the presentation of a new actor critic method defined in the context of approximation spaces. An ecosystem designed to facilitate study of reinforcement learning by swarms is briefly described. In addition, the results of ecosystem experiments for two forms of the actor critic method are given.