Audio segmentation in AAC domain for content analysis

  • Authors:
  • Rong Zhu;Haojun Ai;Ruimin Hu

  • Affiliations:
  • National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan, China;National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan, China;National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan, China

  • Venue:
  • WiCOM'09 Proceedings of the 5th International Conference on Wireless communications, networking and mobile computing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We focus the attention on the audio scene segmentation in AAC domain for audio-based multimedia indexing and retrieval applications. In particular, a MFCC extraction method is proposed, which is adaptive to the window switch in AAC encoding process, and independent of the audio sampling frequency. We discuss the fusion method of MFCC features, which came from different window type in order to keep the balance of the frequency and temporal resolution. A series of experiments via the probability distribution of MFCC were implemented to test the effective in audio scene segmentation. The experimental results show that such approach based on compression domain can approach the performance of the system based on PCM audio, and the CPU overload decreased dramatically. It is meaningful to the real time analysis of audio content.