Heuristic approach for generic audio data segmentation and annotation

Authors:
Tong Zhang;C.-C. Jay Kuo
Affiliations:
Integrated Media Systems Center and Department of Electrical Engineering-Systems, University of Southern California, Los Angeles, CA;Integrated Media Systems Center and Department of Electrical Engineering-Systems, University of Southern California, Los Angeles, CA
Venue:
MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Year:
1999

Citing 4
Cited 13

Query by humming: musical information retrieval in an audio database

Proceedings of the third ACM international conference on Multimedia
Content-Based Classification, Search, and Retrieval of Audio

IEEE MultiMedia
Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Real-time discrimination of broadcast speech/music

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02

Determining computable scenes in films and their structures using audio-visual memory models

MULTIMEDIA '00 Proceedings of the eighth ACM international conference on Multimedia
Automatically extracting highlights for TV Baseball programs

MULTIMEDIA '00 Proceedings of the eighth ACM international conference on Multimedia
Pause concepts for audio segmentation at different semantic levels

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Automatic segmentation of news items based on video and audio features

Journal of Computer Science and Technology
An Object-Oriented Schema for Querying Audio

OOIS '02 Proceedings of the 8th International Conference on Object-Oriented. Information Systems
Automatic Segmentation of News Items Based on Video and Audio Features

PCM '01 Proceedings of the Second IEEE Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Structuring and Querying Documents in an Audio Database Management System

Multimedia Tools and Applications
Movie story intensity representation through audiovisual tempo analysis

Multimedia Tools and Applications
A two level strategy for audio segmentation

Digital Signal Processing
Effective TV advertising block division into single commercials method

KES'11 Proceedings of the 15th international conference on Knowledge-based and intelligent information and engineering systems - Volume Part I
MUSIZ: a generic framework for music resizing with stretching and cropping

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Spoken Content Retrieval: A Survey of Techniques and Technologies

Foundations and Trends in Information Retrieval
Mining movies for song sequences with video based music genre identification system

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

A real-time audio segmentation and indexing scheme is presented in this paper. Audio recordings are segmented and classified into basic audio types such as silence, speech, music, song, environmental sound, speech with the music background, environmental sound with the music background, etc. Simple audio features such as the energy function, the average zero-crossing rate, the fundamental frequency, and the spectral peak track are adopted in this system to ensure on-line processing. Morphological and statistical analysis for temporal curves of these features are performed to show differences among different types of audio. A heuristic rule-based procedure is then developed to segment and classify audio signals by using these features. The proposed approach is generic and model free. It can be applied to almost any content-based audio management system. It is shown that the proposed scheme achieves an accuracy rate of more than 90% for audio classification. Examples for segmentation and indexing of accompanying audio signals in movies and video programs are also provided.