Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning
Artificial Intelligence
Lyapunov design for safe reinforcement learning
The Journal of Machine Learning Research
Dynamic abstraction in reinforcement learning via clustering
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Segmentation of human body parts using deformable triangulation
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans - Special issue on recent advances in biometrics
Handoff self-management based on SNR in mobile communication networks
International Journal of Wireless and Mobile Computing
A multidimensional scaling localisation algorithm based on bacterial foraging optimisation
International Journal of Wireless and Mobile Computing
An efficient service selection framework for pervasive environments
International Journal of Wireless and Mobile Computing
Hi-index | 0.00 |
Aimed at the situation of fitting variable environments in similarity imitation of stepping upstairs for humanoid robot, the process of designing similarity imitation was analysed as well as the rhythm control of key posture in similar locomotion. The simplified locomotion model and retargeting, connection of motion phases, and ZMP stability constraints were shown in similarity locomotion. The hierarchical reinforcement learning MaxQ was used to adjust the movement tracks of similarity to fit the variable stairs. The experiments showed the validity of the method.