A Multi-Pronged Approach to Improving Semantic Extraction of News Video

  • Authors:
  • A. G. Hauptmann;M. -Y. Chen;M. Christel;W. -H. Lin;J. Yang

  • Affiliations:
  • School of Computer Science, Carnegie Mellon University, Pittsburgh, USA;School of Computer Science, Carnegie Mellon University, Pittsburgh, USA;School of Computer Science, Carnegie Mellon University, Pittsburgh, USA;School of Computer Science, Carnegie Mellon University, Pittsburgh, USA;School of Computer Science, Carnegie Mellon University, Pittsburgh, USA

  • Venue:
  • Journal of Signal Processing Systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we describe a multi-strategy approach to improving semantic extraction from news video. Experiments show the value of careful parameter tuning, exploiting multiple feature sets and multilingual linguistic resources, applying text retrieval approaches for image features, and establishing synergy between multiple concepts through undirected graphical models. We present a discriminative learning framework called Multi-concept Discriminative Random Field (MDRF) for building probabilistic models of video semantic concept detectors by incorporating related concepts as well as the low-level observations. The model exploits the power of discriminative graphical models to simultaneously capture the associations of concept with observed data and the interactions between related concepts. Compared with previous methods, this model not only captures the co-occurrence between concepts but also incorporates the raw data observations into a unified framework. We also describe an approximate parameter estimation algorithm and present results obtained from the TRECVID 2006 data. No single approach, however, provides a consistently better result for all concept detection tasks, which suggests that extracting video semantics should exploit multiple resources and techniques rather than naively relying on a single approach