A lexicon-guided LSI method for semantic news video retrieval

Authors:
Juan Cao;Sheng Tang;Jintao Li;Yongdong Zhang;Xuefeng Pan
Affiliations:
Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China and Graduate University of the Chinese Academy of Sciences, Bei ...;Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China and Graduate University of the Chinese Academy of Sciences, Bei ...
Venue:
PCM'07 Proceedings of the multimedia 8th Pacific Rim conference on Advances in multimedia information processing
Year:
2007

Citing 3
Cited 0

SVDPACK: A Fortran-77 Software Library for the Sparse Singular Value Decomposition

SVDPACK: A Fortran-77 Software Library for the Sparse Singular Value Decomposition
A novel method for spoken text feature extraction in semantic video retrieval

PCM'06 Proceedings of the 7th Pacific Rim conference on Advances in Multimedia Information Processing
Video retrieval using high level features: exploiting query matching and confidence-based weighting

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many researchers try to utilize the semantic information extracted from visual feature to directly realize the semantic video retrieval or to supplement the automated speech recognition (ASR) text retrieval. But bridging the gap between the low-level visual feature and semantic content is still a challenging task. In this paper, we study how to effectively use Latent Semantic Indexing (LSI) to improve the semantic video retrieval through the ASR texts. The basic LSI method has been shown effective in the traditional text retrieval and the noisy ASR text retrieval. In this paper, we further use the lexiconguided semantic clustering to effectively remove the noise introduced by news video's additional contents, and use the cluster-based LSI to automatically mine the semantic structure underlying the terms expression. Tests on the TRECVID 2005 dataset show that the above two enhancements achieve 21.3% and 6.9% improvements in performance over the traditional vector-space model(VSM) and the basic LSI separately.