A hybrid cognitive/reactive intelligent agent autonomous path planning technique in a networked-distributed unstructured environment for reinforcement learning

Authors:
Dalila B. Megherbi;Vikram Malayia
Affiliations:
CMINDS Research Center, Electrical & Computer Engineering Department, University of Massachusetts, Lowell, USA 01854;CMINDS Research Center, Electrical & Computer Engineering Department, University of Massachusetts, Lowell, USA 01854
Venue:
The Journal of Supercomputing
Year:
2012

Citing 19
Cited 0

Real-time obstacle avoidance for manipulators and mobile robots

International Journal of Robotics Research
Distributed Artificial Intelligence

Distributed Artificial Intelligence
Integrated architecture for learning, planning, and reacting based on approximating dynamic programming

Proceedings of the seventh international conference (1990) on Machine learning
Lookahead planning and latent learning in a classifier system

Proceedings of the first international conference on simulation of adaptive behavior on From animals to animats
Technical Note: \cal Q-Learning

Machine Learning
Multi-agent reinforcement learning: independent vs. cooperative agents

Readings in agents
Multiagent systems: a modern approach to distributed artificial intelligence

Multiagent systems: a modern approach to distributed artificial intelligence
Multi-Agent Systems: An Introduction to Distributed Artificial Intelligence

Multi-Agent Systems: An Introduction to Distributed Artificial Intelligence
Machine Learning

Machine Learning
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Multiagent Systems: A Survey from a Machine Learning Perspective

Autonomous Robots
Trends in Cooperative Distributed Problem Solving

IEEE Transactions on Knowledge and Data Engineering
Grid load balancing using intelligent agents

Future Generation Computer Systems
An approach to tune fuzzy controllers based on reinforcement learning for autonomous vehicle control

IEEE Transactions on Intelligent Transportation Systems
A new Q-learning algorithm based on the metropolis criterion

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
A Study on Expertise of Agents and Its Effects on Cooperative -Learning

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Ensemble Algorithms in Reinforcement Learning

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Improved Adaptive–Reinforcement Learning Control for Morphing Unmanned Air Vehicles

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Quad-Q-learning

IEEE Transactions on Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a path planning technique for autonomous agent(s) located in an unstructured networked distributed environment, where each agent has limited and not complete knowledge of the environment. Each agent has only the knowledge available in the distributed memory of the computing node the agent is running on and the agents share some information learned over a distributed network. In particular, the environment is divided into several sectors with each sector located on a single separate distributed computing node. We consider hybrid reactive-cognitive agent(s) where we use autonomous agent motion planning that is based on the use of a potential field model accompanied by a reinforcement learning as well as boundary detection algorithms. Potential fields are used for fast convergence toward a path in a distributed environment while reenforcement learning is used to guarantee a variety of behavior and consistent convergence in a distributed environment. We show how the agent decision making process is enhanced by the combination of the two techniques in a distributed environment. Furthermore, path retracing is a challenging problem in a distributed environment, since the agent does not have complete knowledge of the environment. We propose a backtracking technique to keep the distributed agent informed all the time of its path information and step count including when migrating from one node to another. Note that no node has knowledge of the entire global path from a source to a goal when such a goal resides on a separate node. Each agent has only knowledge of a partial path (internal to a node) and related number of steps corresponding to the portion of the path that agent traversed when running on the node. In particular, we show how each of the agents(s), starting in one of the many sectors with no initial knowledge of the environment, using the proposed distributed technique, develops its intelligence based on its experience and seamlessly discovers the shortest global path to the target, which is located in a different node, while avoiding any obstacle(s) it encounters in its way, including when transitioning and migrating from one distributed computing node to another. The agent(s) use (s) multiple-token-ring message passing interface (MPI) to perform internode communication. Finally, the experimental results of the proposed method show that single and multiagents sharing the same goal and running on the same or different nodes successfully coordinate the sharing of their respective environment states/information to collaboratively perform their respective tasks. The results also show that distributed multiagent sharing information increases by an order of magnitude the speed of convergence to the optimal shortest path to the goal in comparison with the single-agent case or noninformation sharing multiagent case.