A robust audio classification and segmentation method
MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Unsupervised Feature Selection Using Feature Similarity
IEEE Transactions on Pattern Analysis and Machine Intelligence
Multi-Modal Dialog Scene Detection Using Hidden Markov Models for Content-Based Multimedia Indexing
Multimedia Tools and Applications
Content-Based Classification, Search, and Retrieval of Audio
IEEE MultiMedia
Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
A Bayesian Approach to Joint Feature Selection and Classifier Design
IEEE Transactions on Pattern Analysis and Machine Intelligence
Real-time discrimination of broadcast speech/music
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Audio classification in speech and music: a comparison between a statistical and a neural approach
EURASIP Journal on Applied Signal Processing
Analytical features: a knowledge-based approach to audio feature generation
EURASIP Journal on Audio, Speech, and Music Processing
A decision-tree-based algorithm for speech/music classification and segmentation
EURASIP Journal on Audio, Speech, and Music Processing
A learning approach to hierarchical feature selection and aggregation for audio classification
Pattern Recognition Letters
A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification
IEEE Transactions on Audio, Speech, and Language Processing
A speech/music discriminator based on RMS and zero-crossings
IEEE Transactions on Multimedia
A Speech/Music Discriminator of Radio Recordings Based on Dynamic Programming and Bayesian Networks
IEEE Transactions on Multimedia
A cross-modal method of labeling music tags
Multimedia Tools and Applications
Towards effective algorithms for intelligent defense systems
CSS'12 Proceedings of the 4th international conference on Cyberspace Safety and Security
An analysis of content-based classification of audio signals using a fuzzy c-means algorithm
Multimedia Tools and Applications
Hi-index | 0.00 |
This paper proposes a hierarchical time-efficient method for audio classification and also presents an automatic procedure to select the best set of features for audio classification using Kolmogorov-Smirnov test (KS-test). The main motivation for our study is to propose a framework of general genre (e.g., action, comedy, drama, documentary, musical, etc...) movie video abstraction scheme for embedded devices-based only on the audio component. Accordingly simple audio features are extracted to ensure the feasibility of real-time processing. Five audio classes are considered in this paper: pure speech, pure music or songs, speech with background music, environmental noise and silence. Audio classification is processed in three stages, (i) silence or environmental noise detection, (ii) speech and non-speech classification and (iii) pure music or songs and speech with background music classification. The proposed system has been tested on various real time audio sources extracted from movies and TV programs. Our experiments in the context of real time processing have shown the algorithms produce very satisfactory results.