Improving bipedwalk stability using real-time corrective human feedback

  • Authors:
  • Ç etin Meriçli;Manuela Veloso

  • Affiliations:
  • Computer Science Department, Carnegie Mellon University, Pittsburgh, PA and Computer Engineering Department, Boğaziçi University, Bebek, Istanbul, Turkey;Computer Science Department, Carnegie Mellon University, Pittsburgh, PA

  • Venue:
  • RoboCup 2010
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Robust walking is one of the key requirements for soccer playing humanoid robots. Developing such a biped walk algorithm is non-trivial due to the complex dynamics of the walk process. In this paper, we first present a method for learning a corrective closed-loop policy to improve the walk stability for the Aldebaran Nao robot using real-time human feedback combined with an openloop walk cycle. The open-loop walk cycle is obtained from the recorded joint commands while the robot is walking using an existing walk algorithm as a blackbox unit. We capture the corrective feedback signals delivered by a human using a wireless feedback mechanism in the form of corrections to the particular joints and we present experimental results showing that a policy learned from a walk algorithm can be used to improve the stability of another walk algorithm. We then follow up with improving the open-loop walk cycle using advice operators before performing real-time human demonstration. During the demonstration, we then capture the sensory readings and the corrections in the form of displacements of the foot positions while the robot is executing improved open-loop walk cycle. We then translate the feet displacement values into individual correction signals for the leg joints using a simplified inverse kinematics calculation. We use a locally weighted linear regression method to learn a mapping from the recorded sensor values to the correction values. Finally, we use a simple anomaly detection method by modeling the changes in the sensory readings throughout the walk cycle during a stable walk as normal distributions and executing the correction policy only if a sensory reading goes beyond the modeled values. Experimental results demonstrate an improvement in the walk stability.