COOPERATIVE LEARNING BY POLICY-SHARING IN MULTIPLE AGENTS
Cybernetics and Systems
Distributed Reinforcement Learning for Coordinate Multi-Robot Foraging
Journal of Intelligent and Robotic Systems
Hi-index | 0.00 |
An improved reinforcement learning algorithm is proposed in this paper. This algorithm is based on linear programming method for finding the best-response policy. A pursuit example is tested and the results show that this algorithm has some properties, such as easy computation, simple operation procedure and can guarantee an good learning convergence.