Automatic parsing and indexing of news video
Multimedia Systems
Informedia: news-on-demand multimedia information acquisition and retrieval
Intelligent multimedia information retrieval
The LIMSI Broadcast News transcription system
Speech Communication - Special issue on automatic transcription of broadcast news data
A Tutorial on Support Vector Machines for Pattern Recognition
Data Mining and Knowledge Discovery
Named Faces: Putting Names to Faces
IEEE Intelligent Systems
Video OCR: indexing digital new libraries by recognition of superimposed captions
Multimedia Systems - Special section on video libraries
Name-It: Association of Face and Name in Video
CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Face recognition: A literature survey
ACM Computing Surveys (CSUR)
Automated annotation of human faces in family albums
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Object Detection Using the Statistics of Parts
International Journal of Computer Vision
Nymble: a high-performance learning name-finder
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
A discriminative learning framework with pairwise constraints for video object classification
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Multiple instance learning for labeling faces in broadcasting news video
Proceedings of the 13th annual ACM international conference on Multimedia
Naming faces in broadcast news video by image google
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Naming faces in films using hypergraph matching
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Character identification in feature-length films using global face-name matching
IEEE Transactions on Multimedia
News video retrieval by learning multimodal semantic information
VISUAL'07 Proceedings of the 9th international conference on Advances in visual information systems
Identifying persons in news article images based on textual analysis
ICADL'10 Proceedings of the role of digital libraries in a time of global change, and 12th international conference on Asia-Pacific digital libraries
Weakly supervised person naming in news video
RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
Mining weakly labeled web facial images for search-based face annotation
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Retrieval-based face annotation by weak label regularized local coordinate coding
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Annotating news video with locations
CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Face retrieval in broadcasting news video by fusing temporal and intensity information
CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Unsupervised face-name association via commute distance
Proceedings of the 20th ACM international conference on Multimedia
Community as a connector: associating faces with celebrity names in web videos
Proceedings of the 20th ACM international conference on Multimedia
Naming persons in video: Using the weak supervision of textual stories
Journal of Visual Communication and Image Representation
Hi-index | 0.00 |
Naming every individual person appearing in broadcast news videos with names detected from the video transcript leads to better access of the news video content. In this paper, we approach this challenging problem with a statistical learning method. Two categories of information extracted from multiple video modalities have been explored, namely features, which help distinguish the true name of every person, as well as constraints, which reveal the relationships among the names of different persons. The person-naming problem is formulated into a learning framework which predicts the most likely name for each person based on the features, and refines the predictions using the constraints. Experiments conducted on ABC World New Tonight and CNN Headline News videos demonstrate that this approach outperforms a non-learning alternative by a large amount.