Brains, Behavior and Robotics
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Scaling Reinforcement Learning toward RoboCup Soccer
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Introduction to Autonomous Mobile Robots
Introduction to Autonomous Mobile Robots
International Journal of Robotics Research
Reinforcement Learning in Continuous Time and Space
Neural Computation
Fuzzy and tile coding function approximation in agent coevolution
AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications
Application of reinforcement learning in robot soccer
Engineering Applications of Artificial Intelligence
A layered approach to learning coordination knowledge in multiagent environments
Applied Intelligence
ICIRA '08 Proceedings of the First International Conference on Intelligent Robotics and Applications: Part I
Adaptive state space partitioning for reinforcement learning
Engineering Applications of Artificial Intelligence
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Monte-Carlo tree search for Bayesian reinforcement learning
Applied Intelligence
Learning via human feedback in continuous state and action spaces
Applied Intelligence
Hi-index | 0.00 |
This paper introduces the path planning of a 1 cm3 mobile microrobot that is designed for microassembly in a microfactory. Since the conventional path planning method can not achieve high microassembly positioning accuracy, a supervised learning assisted reinforcement learning (SL-RL) method has been developed. In this mixed learning method, the reinforcement learning (RL) is used to search a movement path in the normal learning area. But when the microrobot moves into the buffer area, the supervised learning (SL) is employed to prevent it from moving out of the boundary. The SL-RL uses a gradient descent algorithm based on uniform grid tile coding under SARSA(驴) to handle the large learning state space. In addition to the uniform grid tile model, two irregular tile models called an uneven grid tile model and a cobweb tile model are designed to partition the microrobot state space. The main conclusions demonstrated by simulations are as follows: First, the SL-RL method achieves higher positioning accuracy than the conventional path planning method; second, the SL-RL method achieves higher positioning accuracy and learning efficiency than the single RL method; and third, the irregular tile models show higher learning efficiency than the uniform tile model. The cobweb tile model performs especially well.