Semantic Video Retrieval Using Audio Analysis

Authors:
Erwin M. Bakker;Michael S. Lew
Affiliations:
-;-
Venue:
CIVR '02 Proceedings of the International Conference on Image and Video Retrieval
Year:
2002

Citing 5
Cited 6

An overview of audio information retrieval

Multimedia Systems - Special issue on audio and multimedia
Towards robust features for classifying audio in the CueVideo system

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
A robust audio classification and segmentation method

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Content-Based Classification, Search, and Retrieval of Audio

IEEE MultiMedia
Structured Coding for Content Based Interactive Audio

ICMCS '99 Proceedings of the IEEE International Conference on Multimedia Computing and Systems - Volume 2

Challenges of Image and Video Retrieval

CIVR '02 Proceedings of the International Conference on Image and Video Retrieval
Fast video segment retrieval by sort-merge feature selection, boundary refinement, and lazy evaluation

Computer Vision and Image Understanding - Special isssue on video retrieval and summarization
Content-based multimedia information retrieval: State of the art and challenges

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Acoustic environment classification

ACM Transactions on Speech and Language Processing (TSLP)
Interactive multimedia system for distance learning of higher education

Edutainment'06 Proceedings of the First international conference on Technologies for E-Learning and Digital Entertainment
Human interaction categorization by using audio-visual cues

Machine Vision and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Semantic understanding of video is an important frontier in content based retrieval. In the research literature, significant attention has been given to the visual aspect of video, however, relatively little work directly uses audio content for video retrieval. Our paper gives an overview of our current research directions in semantic video retrieval using audio content. We discuss the effectiveness of classifying audio into semantic categories by combining both global and local audio features based in the frequency spectrum. Furthermore, we introduce two novel features called Frequency Spectrum Differentials (FSD), and Differential Swap Rate (DSR), that both model the shape of the spectrum.