Multi-modal interview concept detection for rushes exploitation

Authors:
Anan Liu;Sheng Tang;Yongdong Zhang;Jintao Li;Zhaoxuan Yang
Affiliations:
Tianjin University, Tianjin, China and Chinese Academy of Sciences, Beijing, China;Chinese Academy of Sciences, Beijing, China;Chinese Academy of Sciences, Beijing, China;Chinese Academy of Sciences, Beijing, China;Tianjin University, Tianjin, China and Chinese Academy of Sciences, Beijing, China
Venue:
Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Year:
2007

Citing 3
Cited 0

A Hierarchical and Multi-Model Based Algorithm for Lead Detection and News Program Narrative Parsing

AINA '05 Proceedings of the 19th International Conference on Advanced Information Networking and Applications - Volume 2
Large-Scale Concept Ontology for Multimedia

IEEE MultiMedia
Subspace analysis and optimization for AAM based face alignment

FGR' 04 Proceedings of the Sixth IEEE international conference on Automatic face and gesture recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

According to the concepts of Large-Scale Concept Ontology for Multimedia (LSCOM) and requirement of the 4th task in the 2006 TRECVID, i.e., rushes exploitation, the "interview" concept is an important semantic concept for rushes content analysis. The paper presents the shot-level "interview" concept detection method. Face detection and audio classification are implemented to detect "face" and "speech" concepts for each shot. By integrating audiovisual information, "interview" concept is finally detected. The utilization of the method will definitely benefit the video edit. Large-scale experimental results strongly demonstrate the accuracy and effectiveness of the proposed method.