A fast audio classification from MPEG coded data
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 06
A generic audio classification and segmentation approach for multimedia indexing and retrieval
IEEE Transactions on Audio, Speech, and Language Processing
Hi-index | 0.00 |
We focus the attention on the audio scene segmentation in AAC domain for audio-based multimedia indexing and retrieval applications. In particular, a MFCC extraction method is proposed, which is adaptive to the window switch in AAC encoding process, and independent of the audio sampling frequency. We discuss the fusion method of MFCC features, which came from different window type in order to keep the balance of the frequency and temporal resolution. A series of experiments via the probability distribution of MFCC were implemented to test the effective in audio scene segmentation. The experimental results show that such approach based on compression domain can approach the performance of the system based on PCM audio, and the CPU overload decreased dramatically. It is meaningful to the real time analysis of audio content.