Hierarchical audio content classification system using an optimal feature selection algorithm

Authors:
P. Krishnamoorthy;Sarvesh Kumar
Affiliations:
Samsung India Software Center, Noida, India;Samsung India Software Center, Noida, India
Venue:
Multimedia Tools and Applications
Year:
2011

Citing 14
Cited 3

A robust audio classification and segmentation method

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Unsupervised Feature Selection Using Feature Similarity

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multi-Modal Dialog Scene Detection Using Hidden Markov Models for Content-Based Multimedia Indexing

Multimedia Tools and Applications
Content-Based Classification, Search, and Retrieval of Audio

IEEE MultiMedia
Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
A Bayesian Approach to Joint Feature Selection and Classifier Design

IEEE Transactions on Pattern Analysis and Machine Intelligence
Real-time discrimination of broadcast speech/music

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Audio classification in speech and music: a comparison between a statistical and a neural approach

EURASIP Journal on Applied Signal Processing
Analytical features: a knowledge-based approach to audio feature generation

EURASIP Journal on Audio, Speech, and Music Processing
A decision-tree-based algorithm for speech/music classification and segmentation

EURASIP Journal on Audio, Speech, and Music Processing
A learning approach to hierarchical feature selection and aggregation for audio classification

Pattern Recognition Letters
A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification

IEEE Transactions on Audio, Speech, and Language Processing
A speech/music discriminator based on RMS and zero-crossings

IEEE Transactions on Multimedia
A Speech/Music Discriminator of Radio Recordings Based on Dynamic Programming and Bayesian Networks

IEEE Transactions on Multimedia

A cross-modal method of labeling music tags

Multimedia Tools and Applications
Towards effective algorithms for intelligent defense systems

CSS'12 Proceedings of the 4th international conference on Cyberspace Safety and Security
An analysis of content-based classification of audio signals using a fuzzy c-means algorithm

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a hierarchical time-efficient method for audio classification and also presents an automatic procedure to select the best set of features for audio classification using Kolmogorov-Smirnov test (KS-test). The main motivation for our study is to propose a framework of general genre (e.g., action, comedy, drama, documentary, musical, etc...) movie video abstraction scheme for embedded devices-based only on the audio component. Accordingly simple audio features are extracted to ensure the feasibility of real-time processing. Five audio classes are considered in this paper: pure speech, pure music or songs, speech with background music, environmental noise and silence. Audio classification is processed in three stages, (i) silence or environmental noise detection, (ii) speech and non-speech classification and (iii) pure music or songs and speech with background music classification. The proposed system has been tested on various real time audio sources extracted from movies and TV programs. Our experiments in the context of real time processing have shown the algorithms produce very satisfactory results.