Learning large margin likelihoods for realtime head pose tracking

Authors:
Elisa Ricci;Jean-Marc Odobez
Affiliations:
Idiap Research Institute, Martigny, Switzerland;Idiap Research Institute, Martigny, Switzerland
Venue:
ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Year:
2009

Citing 4
Cited 8

A Probabilistic Framework for Joint Head Tracking and Pose Estimation

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 4 - Volume 04
Histograms of Oriented Gradients for Human Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Pegasos: Primal Estimated sub-GrAdient SOlver for SVM

Proceedings of the 24th international conference on Machine learning
Head Pose Estimation in Computer Vision: A Survey

IEEE Transactions on Pattern Analysis and Machine Intelligence

Space speaks: towards socially and personality aware visual surveillance

Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis
Video mediated social interaction between groups: System requirements and technology challenges

Telematics and Informatics
Investigating the midline effect for visual focus of attention recognition

Proceedings of the 14th ACM international conference on Multimodal interaction
Using self-context for multimodal detection of head nods in face-to-face interactions

Proceedings of the 14th ACM international conference on Multimodal interaction
Linking speaking and looking behavior patterns with group composition, perception, and performance

Proceedings of the 14th ACM international conference on Multimodal interaction
An adaptation framework for head-pose classification in dynamic multi-view scenarios

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Context aware addressee estimation for human robot interaction

Proceedings of the 6th workshop on Eye gaze in intelligent human machine interaction: gaze in multimodal interaction
Real-time audio-visual analysis for multiperson videoconferencing

Advances in Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

We consider the problem of head tracking and pose estimation in realtime from low resolution images. Tracking and pose recognition are treated as two coupled problems in a probabilistic framework: a template-based algorithm with multiple pose-specific reference models is used to determine jointly the position and the scale of the target and its head pose. Target representation is based on Histograms of Oriented Gradients (HOG): descriptors which are at the same time robust under varying illumination, fast to compute and discriminative with respect to pose. To improve pose recognition accuracy, we define the likelihood as a parameterized function and we propose to learn it from training data with a new discriminative approach based on the large-margin paradigm. The performance of the learning algorithm and the tracking are evaluated on public images and video databases.