Model-Based 3D Hand Pose Estimation from Monocular Video

Authors:
Martin de La Gorce;David J. Fleet;Nikos Paragios
Affiliations:
Laboratoire MAS, Ecole Centrale de Paris, Chatenay-Malabry;University of Toronto, Toronto;Laboratoire MAS, Ecole Centrale de Paris, Chatenay-Malabry and INRIA Saclay - Ile-de-France, Orsay
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
2011

Citing 0
Cited 9

A brief review of vision based hand gesture recognition

CSECS'11/MECHANICS'11 Proceedings of the 10th WSEAS international conference on Circuits, Systems, Electronics, Control & Signal Processing, and Proceedings of the 7th WSEAS international conference on Applied and Theoretical Mechanics
Motion capture of hands in action using discriminative salient points

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Hand pose estimation and hand shape classification using multi-layered randomized decision forests

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Combining marker-based mocap and RGB-D camera for acquiring high-fidelity hand motion data

EUROSCA'12 Proceedings of the 11th ACM SIGGRAPH / Eurographics conference on Computer Animation
Combining marker-based mocap and RGB-D camera for acquiring high-fidelity hand motion data

Proceedings of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation
Video-based hand manipulation capture through composite motion control

ACM Transactions on Graphics (TOG) - SIGGRAPH 2013 Conference Proceedings
Recognizing hand gestures using the weighted elastic graph matching (WEGM) method

Image and Vision Computing
Gravity optimised particle filter for hand tracking

Pattern Recognition
Tracking hand rotation and various grasping gestures from an IR camera using extended cylindrical manifold embedding

Computer Vision and Image Understanding

Quantified Score

Hi-index	0.14

Visualization

Abstract

A novel model-based approach to 3D hand tracking from monocular video is presented. The 3D hand pose, the hand texture, and the illuminant are dynamically estimated through minimization of an objective function. Derived from an inverse problem formulation, the objective function enables explicit use of temporal texture continuity and shading information while handling important self-occlusions and time-varying illumination. The minimization is done efficiently using a quasi-Newton method, for which we provide a rigorous derivation of the objective function gradient. Particular attention is given to terms related to the change of visibility near self-occlusion boundaries that are neglected in existing formulations. To this end, we introduce new occlusion forces and show that using all gradient terms greatly improves the performance of the method. Qualitative and quantitative experimental results demonstrate the potential of the approach.