Improving bipedwalk stability using real-time corrective human feedback

Authors:
Ç etin Meriçli;Manuela Veloso
Affiliations:
Computer Science Department, Carnegie Mellon University, Pittsburgh, PA and Computer Engineering Department, Boğaziçi University, Bebek, Istanbul, Turkey;Computer Science Department, Carnegie Mellon University, Pittsburgh, PA
Venue:
RoboCup 2010
Year:
2011

Citing 5
Cited 0

Locally Weighted Learning

Artificial Intelligence Review - Special issue on lazy learning
Learning by demonstration with critique from a human teacher

Proceedings of the ACM/IEEE international conference on Human-robot interaction
Observer-based dynamic walking control for biped robots

Robotics and Autonomous Systems
Interactive policy learning through confidence-based autonomy

Journal of Artificial Intelligence Research
Omnidirectional walking using ZMP and preview control for the NAO humanoid robot

RoboCup 2009

Quantified Score

Hi-index	0.00

Visualization

Abstract

Robust walking is one of the key requirements for soccer playing humanoid robots. Developing such a biped walk algorithm is non-trivial due to the complex dynamics of the walk process. In this paper, we first present a method for learning a corrective closed-loop policy to improve the walk stability for the Aldebaran Nao robot using real-time human feedback combined with an openloop walk cycle. The open-loop walk cycle is obtained from the recorded joint commands while the robot is walking using an existing walk algorithm as a blackbox unit. We capture the corrective feedback signals delivered by a human using a wireless feedback mechanism in the form of corrections to the particular joints and we present experimental results showing that a policy learned from a walk algorithm can be used to improve the stability of another walk algorithm. We then follow up with improving the open-loop walk cycle using advice operators before performing real-time human demonstration. During the demonstration, we then capture the sensory readings and the corrections in the form of displacements of the foot positions while the robot is executing improved open-loop walk cycle. We then translate the feet displacement values into individual correction signals for the leg joints using a simplified inverse kinematics calculation. We use a locally weighted linear regression method to learn a mapping from the recorded sensor values to the correction values. Finally, we use a simple anomaly detection method by modeling the changes in the sensory readings throughout the walk cycle during a stable walk as normal distributions and executing the correction policy only if a sensory reading goes beyond the modeled values. Experimental results demonstrate an improvement in the walk stability.