A Probabilistic Framework for Joint Head Tracking and Pose Estimation
ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 4 - Volume 04
Histograms of Oriented Gradients for Human Detection
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Pegasos: Primal Estimated sub-GrAdient SOlver for SVM
Proceedings of the 24th international conference on Machine learning
Head Pose Estimation in Computer Vision: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence
Space speaks: towards socially and personality aware visual surveillance
Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis
Video mediated social interaction between groups: System requirements and technology challenges
Telematics and Informatics
Investigating the midline effect for visual focus of attention recognition
Proceedings of the 14th ACM international conference on Multimodal interaction
Using self-context for multimodal detection of head nods in face-to-face interactions
Proceedings of the 14th ACM international conference on Multimodal interaction
Linking speaking and looking behavior patterns with group composition, perception, and performance
Proceedings of the 14th ACM international conference on Multimodal interaction
An adaptation framework for head-pose classification in dynamic multi-view scenarios
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Context aware addressee estimation for human robot interaction
Proceedings of the 6th workshop on Eye gaze in intelligent human machine interaction: gaze in multimodal interaction
Real-time audio-visual analysis for multiperson videoconferencing
Advances in Multimedia
Hi-index | 0.00 |
We consider the problem of head tracking and pose estimation in realtime from low resolution images. Tracking and pose recognition are treated as two coupled problems in a probabilistic framework: a template-based algorithm with multiple pose-specific reference models is used to determine jointly the position and the scale of the target and its head pose. Target representation is based on Histograms of Oriented Gradients (HOG): descriptors which are at the same time robust under varying illumination, fast to compute and discriminative with respect to pose. To improve pose recognition accuracy, we define the likelihood as a parameterized function and we propose to learn it from training data with a new discriminative approach based on the large-margin paradigm. The performance of the learning algorithm and the tracking are evaluated on public images and video databases.