Multimedia edges: finding hierarchy in all dimensions

  • Authors:
  • Malcolm Slaney;Dulce Ponceleon;James Kaufman

  • Affiliations:
  • IBM Almaden Research Center, San Jose, CA;IBM Almaden Research Center, San Jose, CA;IBM Almaden Research Center, San Jose, CA

  • Venue:
  • MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a new unified representation for the informa驴tion in a video. We reduce the dimensionality of the signal with either a singular-value decomposition (on the semantic and image data) or mel-frequency cepstral coefficients (on the audio data) and then concatenate the vectors to form a multi-dimensional represen驴tation of the video. Using scale-space techniques we find large jumps in the video's path, which we call edges. We use these tech驴niques to analyze the temporal properties of the audio and image data in a video. This analysis creates a hierarchical segmentation of the video, or a table-of-contents, from the audio, semantic and image data.