Differential video coding of face and gesture events in presentation videos

Authors:
Robin Tan;James W. Davis
Affiliations:
Computer Vision Laboratory, Department of Computer Science and Engineering, Ohio State University;Computer Vision Laboratory, Department of Computer Science and Engineering, Ohio State University
Venue:
Computer Vision and Image Understanding - Special issue on event detection in video
Year:
2004

Citing 19
Cited 5

Pfinder: Real-Time Tracking of the Human Body

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multimedia communications protocols and applications

Multimedia communications protocols and applications
Neural Network-Based Face Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Example-Based Learning for View-Based Human Face Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Frontal-view face detection and facial feature extraction using color, shape and symmetry based cost functions

Pattern Recognition Letters
Automatic detection of 'Goal' segments in basketball videos

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Scene context dependent rate control

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
A new foreground extraction scheme for video streams

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Assessing face and speech consistency for monologue detection in video

Proceedings of the tenth ACM international conference on Multimedia
Finding Naked People

ECCV '96 Proceedings of the 4th European Conference on Computer Vision-Volume II - Volume II
Are Listeners Paying Attention to the Hand Gestures of an Anthropomorphic Agent? An Evaluation Using a Gaze Tracking Method

Proceedings of the International Gesture Workshop on Gesture and Sign Language in Human-Computer Interaction
Color-Based Hands Tracking System for Sign Language Recognition

FG '98 Proceedings of the 3rd. International Conference on Face & Gesture Recognition
Human activity detection in MPEG sequences

HUMO '00 Proceedings of the Workshop on Human Motion (HUMO'00)
Comparison of Five Color Models in Skin Pixel Classification

RATFG-RTS '99 Proceedings of the International Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems
Gesture Cues for Conversational Interaction in Monocular Video

RATFG-RTS '99 Proceedings of the International Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems
Real Time Face and Object Tracking as a Component of a Perceptual User Interface

WACV '98 Proceedings of the 4th IEEE Workshop on Applications of Computer Vision (WACV'98)
Indexing Colored Surfaces in Images

ICPR '96 Proceedings of the International Conference on Pattern Recognition (ICPR '96) Volume III-Volume 7276 - Volume 7276
A perceptual user interface for recognizing head gesture acknowledgements

Proceedings of the 2001 workshop on Perceptive user interfaces
Improving continuous gesture recognition with spoken prosody

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition

Gesture modeling and animation based on a probabilistic re-creation of speaker style

ACM Transactions on Graphics (TOG)
Hand gesture recognition using a neural network shape fitting technique

Engineering Applications of Artificial Intelligence
Augmented reality as means for creating shared understanding

European Conference on Cognitive Ergonomics: Designing beyond the Product --- Understanding Activity and User Experience in Ubiquitous Environments
Statistical classification of skin color pixels from MPEG videos

ACIVS'07 Proceedings of the 9th international conference on Advanced concepts for intelligent vision systems
Using a game controller for relaying deictic gestures in computer-mediated communication

International Journal of Human-Computer Studies

Quantified Score

Hi-index	0.00

Visualization

Abstract

Currently, bandwidth limitations pose a major challenge for delivering high-quality multimedia information over the Internet to users. In this research, we aim to provide a better compression of presentation videos (e.g., lectures). The approach is based on the idea that people tend to pay more attention to the face and gesturing hands, and therefore these regions are given more resolution than the remaining image. Our method first detects and tracks the face and hand regions using color-based segmentation and Kalman filtering. Next, different classes of natural hand gesture are recognized from the hand trajectories by identifying gesture holds, position/velocity changes, and repetitive movements. The detected face/ hand regions and gesture events in the video are then encoded at higher resolution than the remaining lower-resolution background. We present results of the tracking and gesture recognition approach, and evaluate and compare videos compressed with the proposed method to uniform compression.