A reduced yet extensible audio-visual description language

  • Authors:
  • Raphaël Troncy;Jean Carrive

  • Affiliations:
  • Institut National de l'Audiovisuel, Bry-sur-Marne, France;Institut National de l'Audiovisuel, Bry-sur-Marne, France

  • Venue:
  • Proceedings of the 2004 ACM symposium on Document engineering
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Enabling an intelligent access to multimedia data requires a powerful description language. In this paper we demonstrate why the MPEG-7 standard fails to fulfill this task. We introduce then our proposition: an audio-visual specific description language modular reduced but designed to be extensible. This language is centered on the notions of descriptor and structure with a well-defined semantics. A descriptor can be a low-level feature automatically extracted from the signal or a higher semantic concept that will be used to annotate the video documents. The descriptors can be combined into structures according to defined models that provide description patterns.