Cepstral modulation ratio regression (CMRARE) parameters for audio signal analysis and classification

  • Authors:
  • Rainer Martin;Anil Nagathil

  • Affiliations:
  • Institute of Communication Acoustics, Ruhr-Universität Bochum, Germany;Institute of Communication Acoustics, Ruhr-Universität Bochum, Germany

  • Venue:
  • ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we propose a new set of parameters for audio signal analysis and classification. These parameters are regressions computed on the normalized modulation spectrum of high-resolution cepstral coefficients. The parameter set is scalable in its size and gives a compact representation of the modulation content of speech and other audio signals. These parameters as well as the regression approximation error are well suited for characterizing audio signals in a unified framework. In particular we use a set of eight parameters in a speech/music/noise classification task in which we achieve a classification accuracy which compares very well with other approaches including static and dynamic MFCCs.