Latent semantic analysis for an effective region-based video shot retrieval system

  • Authors:
  • Fabrice Souvannavong;Bernard Merialdo;Benoît Huet

  • Affiliations:
  • Institut Eurécom, Sophia-Antipolis - France;Institut Eurécom, Sophia-Antipolis - France;Institut Eurécom, Sophia-Antipolis - France

  • Venue:
  • Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval
  • Year:
  • 2004

Quantified Score

Hi-index 0.01

Visualization

Abstract

We present a complete and efficient framework for video shot indexing and retrieval. Video shots are described by their key-frame, themselves described by their regions. Region-based approaches suffer from the complexity of segmentation and comparison tasks. A compact region-based shot representation is usually obtained thanks to vector-quantization method. We thus introduce LSA to reduce the noise inherent to the segmentation and the quantization processes. Then to better capture the content of video shots, we propose two original methods. The first takes advantage of a multi-scale segmentation of frames while the second uses multiple frames to represent a shot. Both approaches require more computation time during the pre-processing but not for indexing and comparison tasks. Indeed the extra information is included in the original signatures of shots. Finally we introduce a relevance feedback loop to optimize the search and propose a new method to optimize the effect of LSA. In the experimental section, we make an evaluation of latent semantic analysis and proposed approaches on two problems, namely object retrieval and semantic content estimation