Making chroma features more robust to timbre changes

  • Authors:
  • Meinard Muller;Sebastian Ewert;Sebastian Kreuzer

  • Affiliations:
  • Saarland University and MPI Informatik, Campus E1 4, D-66123 Saarbrücken, Germany;Universität Bonn, Informatik III, Römerstr. 164, D-53117, Germany;Universität Bonn, Informatik III, Römerstr. 164, D-53117, Germany

  • Venue:
  • ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Chroma-based audio features are a well-established tool for analyzing and comparing music data. By identifying spectral components that differ by a musical octave, chroma features show a high degree of invariance to variations in timbre. In this paper, we describe a novel procedure for making chroma features even more robust to changes in timbre and instrumentation while keeping their discriminative power. Our idea is based on the generally accepted observation that the lower mel-frequency cepstral coefficients (MFCCs) are closely related to timbre. Now, instead of keeping the lower coefficients, we will discard them and only keep the upper coefficients. Furthermore, using a pitch scale instead of a mel scale allows us to project the remaining coefficients onto the twelve chroma bins. Our systematic experiments show that the resulting chroma features have indeed gained a significant boost towards timbre invariance.