Fundamentals of speech recognition
Fundamentals of speech recognition
Content-based music structure analysis with applications to music semantics understanding
Proceedings of the 12th annual ACM international conference on Multimedia
Music score alignment and computer accompaniment
Communications of the ACM - Music information retrieval
Psychoacoustics: Facts and Models
Psychoacoustics: Facts and Models
Digital Signal Processing (4th Edition)
Digital Signal Processing (4th Edition)
Musical instrument recognition using cepstral coefficients and temporal features
ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 02
Towards structural analysis of audio recordings in the presence of musical variations
EURASIP Journal on Applied Signal Processing
Information Retrieval for Music and Motion
Information Retrieval for Music and Motion
Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification
IEEE Transactions on Audio, Speech, and Language Processing
Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model
IEEE Transactions on Audio, Speech, and Language Processing
Efficient Index-Based Audio Matching
IEEE Transactions on Audio, Speech, and Language Processing
IEEE Transactions on Audio, Speech, and Language Processing
Analysis of the meter of acoustic musical signals
IEEE Transactions on Audio, Speech, and Language Processing
IEEE Transactions on Audio, Speech, and Language Processing
Audio thumbnailing of popular music using chroma-based representations
IEEE Transactions on Multimedia
Multi-objective feature selection in music genre and style recognition tasks
Proceedings of the 13th annual conference on Genetic and evolutionary computation
The need for music information retrieval with user-centered and multimodal strategies
MIRUM '11 Proceedings of the 1st international ACM workshop on Music information retrieval with user-centered and multimodal strategies
Fast intra-collection audio matching
Proceedings of the second international ACM workshop on Music information retrieval with user-centered and multimodal strategies
Bilingual analysis of song lyrics and audio words
Proceedings of the 20th ACM international conference on Multimedia
On the Relative Importance of Individual Components of Chord Recognition Systems
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Hi-index | 0.00 |
Chroma-based audio features are a well-established tool for analyzing and comparing harmony-based Western music that is based on the equal-tempered scale. By identifying spectral components that differ by a musical octave, chroma features possess a considerable amount of robustness to changes in timbre and instrumentation. In this paper, we describe a novel procedure that further enhances chroma features by significantly boosting the degree of timbre invariance without degrading the features' discriminative power. Our idea is based on the generally accepted observation that the lower mel-frequency cepstral coefficients (MFCCs) are closely related to timbre. Now, instead of keeping the lower coefficients, we discard them and only keep the upper coefficients. Furthermore, using a pitch scale instead of a mel scale allows us to project the remaining coefficients onto the 12 chroma bins. We present a series of experiments to demonstrate that the resulting chroma features outperform various state-of-the art features in the context of music matching and retrieval applications. As a final contribution, we give a detailed analysis of our enhancement procedure revealing the musical meaning of certain pitch-frequency cepstral coefficients.