Measuring novelty and redundancy with multiple modalities in cross-lingual broadcast news

Authors:
Xiao Wu;Alexander G. Hauptmann;Chong-Wah Ngo
Affiliations:
School of Computer Science, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA, USA and Department of Computer Science, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong ...;School of Computer Science, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA, USA;Department of Computer Science, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong
Venue:
Computer Vision and Image Understanding
Year:
2008

Citing 24
Cited 4

Copy detection mechanisms for digital documents

SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
A study of smoothing methods for language models applied to Ad Hoc information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Topic Detection and Tracking: Event-Based Information Organization

Topic Detection and Tracking: Event-Based Information Organization
The LIMSI Broadcast News transcription system

Speech Communication - Special issue on automatic transcription of broadcast news data
Novelty and redundancy detection in adaptive filtering

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Topic-conditioned novelty detection

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Query by video clip

Multimedia Systems - Special section on video libraries
Retrieval and novelty detection at the sentence level

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
A System for new event detection

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Hierarchical video content description and summarization using unified semantic and visual similarity

Multimedia Systems
Newsjunkie: providing personalized newsfeeds via analysis of information novelty

Proceedings of the 13th international conference on World Wide Web
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Language-specific models in multilingual topic tracking

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Story boundary detection in large broadcast news video archives: techniques, experience and trends

Proceedings of the 12th annual ACM international conference on Multimedia
Towards auto-documentary: tracking the evolution of news stories

Proceedings of the 12th annual ACM international conference on Multimedia
An efficient parts-based near-duplicate and sub-image retrieval system

Proceedings of the 12th annual ACM international conference on Multimedia
Detecting image near-duplicate by stochastic attributed relational graph matching with learning

Proceedings of the 12th annual ACM international conference on Multimedia
Similarity measures for tracking information flow

Proceedings of the 14th ACM international conference on Information and knowledge management
Novelty detection based on sentence level patterns

Proceedings of the 14th ACM international conference on Information and knowledge management
Tracking news stories across different sources

Proceedings of the 13th annual ACM international conference on Multimedia
Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Near-duplicate keyframe retrieval with visual keywords and semantic context

Proceedings of the 6th ACM international conference on Image and video retrieval
Efficient video similarity measurement with video signature

IEEE Transactions on Circuits and Systems for Video Technology
Clip-based similarity measure for query-dependent clip retrieval and video summarization

IEEE Transactions on Circuits and Systems for Video Technology

Scene duplicate detection from videos based on trajectories of feature points

Proceedings of the international workshop on Workshop on multimedia information retrieval
Guest Editorial: Similarity Matching in Computer Vision and Multimedia

Computer Vision and Image Understanding
Concept-Based Video Retrieval

Foundations and Trends in Information Retrieval
Video copy detection using multiple visual cues and MPEG-7 descriptors

Journal of Visual Communication and Image Representation

Quantified Score

Hi-index	0.00

Visualization

Abstract

News videos from different channels, languages are broadcast everyday, which provide abundant information for users. To effectively search, retrieve, browse and track news stories, news story similarity plays a critical role in assessing the novelty and redundancy among news stories. In this paper, we explore different measures of novelty and redundancy detection for cross-lingual news stories. A news story is represented by multimodal features which include a sequence of keyframes in the visual track, and a set of words and named entities extracted from speech transcript in the audio track. Vector space models and language models on individual features (text, named entities and keyframes) are constructed to compare the similarity among stories. Furthermore, multiple modalities are further fused to improve the performance. Experiments on the TRECVID-2005 cross-lingual news video corpus showed that modalities and measures demonstrate variant performance for novelty and redundancy detection. Language models on text are appropriate for detecting completely redundant stories, while Cosine Distance on keyframes is suitable for detecting somewhat redundant stories. The performance on mono-lingual topics is better than multilingual topics. Textual features and visual features complement each other, and fusion of text, named entities and keyframes substantially improves the performance, which outperforms approaches with just individual features.