Multimedia edges: finding hierarchy in all dimensions

Authors:
Malcolm Slaney;Dulce Ponceleon;James Kaufman
Affiliations:
IBM Almaden Research Center, San Jose, CA;IBM Almaden Research Center, San Jose, CA;IBM Almaden Research Center, San Jose, CA
Venue:
MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Year:
2001

Citing 9
Cited 5

Uniqueness of the Gaussian Kernel for Scale-Space Filtering

IEEE Transactions on Pattern Analysis and Machine Intelligence
TOPIC ISLANDS—a wavelet-based text visualization system

Proceedings of the conference on Visualization '98
Statistical Models for Text Segmentation

Machine Learning - Special issue on natural language learning
Visualizing music and audio using self-similarity

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Clustering by Scale-Space Filtering

IEEE Transactions on Pattern Analysis and Machine Intelligence
Advances in domain independent linear text segmentation

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Multi-paragraph segmentation of expository text

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Hierarchical segmentation using latent semantic indexing in scale space

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 03
"What is in that video anyway?": In Search of Better Browsing

ICMCS '99 Proceedings of the IEEE International Conference on Multimedia Computing and Systems - Volume 2

Temporal event clustering for digital photo collections

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Video segmentation combining similarity analysis and classification

Proceedings of the 12th annual ACM international conference on Multimedia
Temporal event clustering for digital photo collections

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Towards a Smarter Meeting Record--Capture and Access of Meetings Revisited

Multimedia Tools and Applications
vADeo: video advertising system

Proceedings of the 15th international conference on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes a new unified representation for the informa驴tion in a video. We reduce the dimensionality of the signal with either a singular-value decomposition (on the semantic and image data) or mel-frequency cepstral coefficients (on the audio data) and then concatenate the vectors to form a multi-dimensional represen驴tation of the video. Using scale-space techniques we find large jumps in the video's path, which we call edges. We use these tech驴niques to analyze the temporal properties of the audio and image data in a video. This analysis creates a hierarchical segmentation of the video, or a table-of-contents, from the audio, semantic and image data.