A highly efficient system for automatic face region detection in MPEG video

Authors:
Hualu Wang;Shih-Fu Chang
Affiliations:
Dept. of Electr. Eng., Columbia Univ., New York, NY;-
Venue:
IEEE Transactions on Circuits and Systems for Video Technology
Year:
1997

Citing 0
Cited 50

Searching and editing MPEG-compressed video in a distributed online environment

Multimedia Systems
On face detection in the compressed domain

MULTIMEDIA '00 Proceedings of the eighth ACM international conference on Multimedia
Panoramic video capturing and compressed domain virtual camera control

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Detecting Faces in Images: A Survey

IEEE Transactions on Pattern Analysis and Machine Intelligence
Stratification Approach to Modeling Video

Multimedia Tools and Applications
Dialogue Scenes Detection in MPEG Movies: A Multi-expert Approach

MDIC '01 Proceedings of the Second International Workshop on Multimedia Databases and Image Communication
Neural Networks Retraining for Unsupervised Video Object Segmentation of Videoconference Sequences

ICANN '02 Proceedings of the International Conference on Artificial Neural Networks
Efficient Face Extraction Using Skin-Color Model and a Neural Network

IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Face Detection by Integrating Multiresolution-Based Watersheds and a Skin-Color Model

IEA/AIE '02 Proceedings of the 15th international conference on Industrial and engineering applications of artificial intelligence and expert systems: developments in applied artificial intelligence
Efficient and Automatic Faces Detection Based on Skin-Tone and Neural Network Model

IEA/AIE '02 Proceedings of the 15th international conference on Industrial and engineering applications of artificial intelligence and expert systems: developments in applied artificial intelligence
Automatic Closed Caption Detection and Font Size Differentiation in MPEG Video

VISUAL '02 Proceedings of the 5th International Conference on Recent Advances in Visual Information Systems
Motion Activity Based Shot Identification and Closed Caption Detection for Video Structuring

VISUAL '02 Proceedings of the 5th International Conference on Recent Advances in Visual Information Systems
A Multi-expert System for Movie Segmentation

MCS '02 Proceedings of the Third International Workshop on Multiple Classifier Systems
A Fast Anchor Shot Detection Algorithm on Compressed Video

PCM '01 Proceedings of the Second IEEE Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Face Detection for Video Summaries

CIVR '02 Proceedings of the International Conference on Image and Video Retrieval
Scene Segmentation and Image Feature Extraction for Video Indexing and Retrieval

VISUAL '99 Proceedings of the Third International Conference on Visual Information and Information Systems
Face Detection and Its Applications in Intelligent and Focused Image Retrieval

ICTAI '99 Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence
Security of human video objects by incorporating a chaos-based feedback cryptographic scheme

Proceedings of the 12th annual ACM international conference on Multimedia
Real-time foveation techniques for low bit rate video coding

Real-Time Imaging
Skin Segmentation Using Color Pixel Classification: Analysis and Comparison

IEEE Transactions on Pattern Analysis and Machine Intelligence
Recent advances in visual and infrared face recognition: a review

Computer Vision and Image Understanding
A Survey of MPEG-1 Audio, Video and Semantic Analysis Techniques

Multimedia Tools and Applications
WebGuard: A Web Filtering Engine Combining Textual, Structural, and Visual Content-Based Analysis

IEEE Transactions on Knowledge and Data Engineering
Role of edge detection in video semantics

VIP '02 Selected papers from the 2002 Pan-Sydney workshop on Visualisation - Volume 22
Edge-based semantic classification of sports video sequences

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
An algorithm to estimate mean vehicle speed from MPEG Skycam video

Multimedia Tools and Applications
VAST MM: multimedia browser for presentation video

Proceedings of the 6th ACM international conference on Image and video retrieval
MPEG-2 compressed-domain algorithms for video analysis

EURASIP Journal on Applied Signal Processing
Lightweight object tracking in compressed video streams demonstrated in region-of-interest coding

EURASIP Journal on Applied Signal Processing
A committee machine scheme for feature map fusion under uncertainty: the face detection case

International Journal of Intelligent Systems Technologies and Applications
An automatic human video objects encryption scheme built on stream and block ciphers and based on chaos

ICS'05 Proceedings of the 9th WSEAS International Conference on Systems
Saliency model-based face segmentation and tracking in head-and-shoulder video sequences

Journal of Visual Communication and Image Representation
SEGMENTATION OF MULTIPLE HUMAN OBJECTS IN VIDEO SEQUENCES

Applied Artificial Intelligence
Exploiting Voronoi diagram properties in face segmentation and feature extraction

Pattern Recognition
Face detection and recognition of natural human emotion using Markov random fields

Personal and Ubiquitous Computing
A Decision Tree Approach for Scene Pattern Recognition and Extraction in Snooker Videos

ICIAR '09 Proceedings of the 6th International Conference on Image Analysis and Recognition
On the performance of kernel methods for skin color segmentation

EURASIP Journal on Advances in Signal Processing
Robust Human Face Detection for Moving Pictures Based on Cascade-Typed Hybrid Classifier

ICIC '07 Proceedings of the 3rd International Conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence
Statistical classification of skin color pixels from MPEG videos

ACIVS'07 Proceedings of the 9th international conference on Advanced concepts for intelligent vision systems
A semantic framework for video genre classification and event analysis

Image Communication
Hand localization and fingers features extraction: application to digit recognition in sign language

IDEAL'09 Proceedings of the 10th international conference on Intelligent data engineering and automated learning
Face detection directly from H.264 compressed video with convolutional neural network

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Honeycomb model based skin colour detector for face detection

International Journal of Computer Applications in Technology
Actor based video indexing and retrieval using visual information

ICNC'06 Proceedings of the Second international conference on Advances in Natural Computation - Volume Part II
Vision-Based recognition of hand shapes in taiwanese sign language

ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction
Background segmentation beyond RGB

ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part II
Face detection in resource constrained wireless systems

Mobile Multimedia Processing
A cascade face recognition system using hybrid feature extraction

Digital Signal Processing
Behavior recognition from video based on human constrained descriptor and adaptable neural networks

Proceedings of the 4th ACM/IEEE international workshop on Analysis and retrieval of tracked events and motion in imagery stream
Detection of human faces in a compressed domain for video stratification

The Visual Computer: International Journal of Computer Graphics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Human faces provide a useful cue in indexing video content. We present a highly efficient system that can rapidly detect human face regions in MPEG video sequences. The underlying algorithm takes the inverse quantized discrete cosine transform (DCT) coefficients of MPEG video as the input, and outputs the locations of the detected face regions. The algorithm consists of three stages, where chrominance, shape, and frequency information are used, respectively. By detecting faces directly in the compressed domain, there is no need to carry out the inverse DCT transform, so that the algorithm can run faster than the real time. In our experiments, the algorithm detected 85-92% of the faces in three test sets, including both intraframe and interframe coded image frames from news video. The average run time ranges from 13-33 ms per frame. The algorithm can be applied to JPEG unconstrained images or motion JPEG video as well