Naming every individual in news video monologues

Authors:
Jun Yang;Alexander G. Hauptmann
Affiliations:
Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA
Venue:
Proceedings of the 12th annual ACM international conference on Multimedia
Year:
2004

Citing 13
Cited 14

Automatic parsing and indexing of news video

Multimedia Systems
Informedia: news-on-demand multimedia information acquisition and retrieval

Intelligent multimedia information retrieval
The LIMSI Broadcast News transcription system

Speech Communication - Special issue on automatic transcription of broadcast news data
A Tutorial on Support Vector Machines for Pattern Recognition

Data Mining and Knowledge Discovery
Named Faces: Putting Names to Faces

IEEE Intelligent Systems
Video OCR: indexing digital new libraries by recognition of superimposed captions

Multimedia Systems - Special section on video libraries
Name-It: Association of Face and Name in Video

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Face recognition: A literature survey

ACM Computing Surveys (CSUR)
Automated annotation of human faces in family albums

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Object Detection Using the Statistics of Parts

International Journal of Computer Vision
Nymble: a high-performance learning name-finder

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
A discriminative learning framework with pairwise constraints for video object classification

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Names and faces in the news

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Multiple instance learning for labeling faces in broadcasting news video

Proceedings of the 13th annual ACM international conference on Multimedia
Naming faces in broadcast news video by image google

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Naming faces in films using hypergraph matching

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Character identification in feature-length films using global face-name matching

IEEE Transactions on Multimedia
News video retrieval by learning multimodal semantic information

VISUAL'07 Proceedings of the 9th international conference on Advances in visual information systems
Identifying persons in news article images based on textual analysis

ICADL'10 Proceedings of the role of digital libraries in a time of global change, and 12th international conference on Asia-Pacific digital libraries
Weakly supervised person naming in news video

RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
Mining weakly labeled web facial images for search-based face annotation

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Retrieval-based face annotation by weak label regularized local coordinate coding

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Annotating news video with locations

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Face retrieval in broadcasting news video by fusing temporal and intensity information

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Unsupervised face-name association via commute distance

Proceedings of the 20th ACM international conference on Multimedia
Community as a connector: associating faces with celebrity names in web videos

Proceedings of the 20th ACM international conference on Multimedia
Naming persons in video: Using the weak supervision of textual stories

Journal of Visual Communication and Image Representation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Naming every individual person appearing in broadcast news videos with names detected from the video transcript leads to better access of the news video content. In this paper, we approach this challenging problem with a statistical learning method. Two categories of information extracted from multiple video modalities have been explored, namely features, which help distinguish the true name of every person, as well as constraints, which reveal the relationships among the names of different persons. The person-naming problem is formulated into a learning framework which predicts the most likely name for each person based on the features, and refines the predictions using the constraints. Experiments conducted on ABC World New Tonight and CNN Headline News videos demonstrate that this approach outperforms a non-learning alternative by a large amount.