Reinforcement based mobile robot navigation in dynamic environment

  • Authors:
  • Mohammad Abdel Kareem Jaradat;Mohammad Al-Rousan;Lara Quadan

  • Affiliations:
  • Mechanical Engineering Department, Faculty of Engineering, Jordan University of Science & Technology, Irbid 22110, Jordan;Computer Engineering Departments, Faculty of Computer & Information Technology, Jordan University of Science & Technology, Irbid 22110, Jordan;Computer Engineering Departments, Faculty of Computer & Information Technology, Jordan University of Science & Technology, Irbid 22110, Jordan

  • Venue:
  • Robotics and Computer-Integrated Manufacturing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, a new approach is developed for solving the problem of mobile robot path planning in an unknown dynamic environment based on Q-learning. Q-learning algorithms have been used widely for solving real world problems, especially in robotics since it has been proved to give reliable and efficient solutions due to its simple and well developed theory. However, most of the researchers who tried to use Q-learning for solving the mobile robot navigation problem dealt with static environments; they avoided using it for dynamic environments because it is a more complex problem that has infinite number of states. This great number of states makes the training for the intelligent agent very difficult. In this paper, the Q-learning algorithm was applied for solving the mobile robot navigation in dynamic environment problem by limiting the number of states based on a new definition for the states space. This has the effect of reducing the size of the Q-table and hence, increasing the speed of the navigation algorithm. The conducted experimental simulation scenarios indicate the strength of the new proposed approach for mobile robot navigation in dynamic environment. The results show that the new approach has a high Hit rate and that the robot succeeded to reach its target in a collision free path in most cases which is the most desirable feature in any navigation algorithm.