Can High-Level Concepts Fill the Semantic Gap in Video Retrieval? A Case Study With Broadcast News

  • Authors:
  • A. Hauptmann;Rong Yan;Wei-Hao Lin;M. Christel;H. Wactlar

  • Affiliations:
  • Carnegie Mellon Univ., Pittsburgh;-;-;-;-

  • Venue:
  • IEEE Transactions on Multimedia
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

A number of researchers have been building high-level semantic concept detectors such as outdoors, face, building, to help with semantic video retrieval. Our goal is to examine how many concepts would be needed, and how they should be selected and used. Simulating performance of video retrieval under different assumptions of concept detection accuracy, we find that good retrieval can be achieved even when detection accuracy is low, if sufficiently many concepts are combined. We also derive suggestions regarding the types of concepts that would be most helpful for a large concept lexicon. Since our user study finds that people cannot predict which concepts will help their query, we also suggest ways to find the best concepts to use. Ultimately, this paper concludes that "concept-based" video retrieval with fewer than 5000 concepts, detected with a minimal accuracy of 10% mean average precision is likely to provide high accuracy results in broadcast news retrieval.