Automatic music genre classification based on modulation spectral analysis of spectral and cepstral features

Authors:
Chang-Hsing Lee;Jau-Ling Shih;Kun-Ming Yu;Hwai-San Lin
Affiliations:
Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan;Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan;Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan;Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan
Venue:
IEEE Transactions on Multimedia
Year:
2009

Citing 15
Cited 4

Fundamentals of speech recognition

Fundamentals of speech recognition
On Combining Classifiers

IEEE Transactions on Pattern Analysis and Machine Intelligence
Robust speech recognition using the modulation spectrogram

Speech Communication - Special issue on robust speech recognition
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
A Wavelet Packet representation of audio signals for music genre classification using different ensemble and feature selection techniques

MIR '03 Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
MPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval

MPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval
Aggregate features and ADABOOST for music classification

Machine Learning
Joint acoustic and modulation frequency

EURASIP Journal on Applied Signal Processing
Modulation-scale analysis for content identification

IEEE Transactions on Signal Processing - Part II
Modeling timbre distance with temporal statistics from polyphonic music

IEEE Transactions on Audio, Speech, and Language Processing
Temporal Feature Integration for Music Genre Classification

IEEE Transactions on Audio, Speech, and Language Processing
Multigroup classification of audio signals using time-frequency parameters

IEEE Transactions on Multimedia
Toward intelligent music information retrieval

IEEE Transactions on Multimedia
Content-Based Information Fusion for Semi-Supervised Music Genre Classification

IEEE Transactions on Multimedia
Audio classification based on MPEG-7 spectral basis representations

IEEE Transactions on Circuits and Systems for Video Technology

On feature combination for music classification

SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Unsupervised music genre classification with a model-based approach

EPIA'11 Proceedings of the 15th Portugese conference on Progress in artificial intelligence
An analysis of the GTZAN music genre dataset

Proceedings of the second international ACM workshop on Music information retrieval with user-centered and multimodal strategies
An analysis of content-based classification of audio signals using a fuzzy c-means algorithm

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we will propose an automatic music genre classification approach based on long-term modulation spectral analysis of spectral (OSC and MPEG-7 NASE) as well as cepstral (MFCC) features. Modulation spectral analysis of every feature value will generate a corresponding modulation spectrum and all the modulation spectra can be collected to form a modulation spectrogram which exhibits the time-varying or rhythmic information of music signals. Each modulation spectrum is then decomposed into several logarithmically-spaced modulation sub-bands. The modulation spectral contrast (MSC) and modulation spectral valley (MSV) are then computed from each modulation subband. Effective and compact features are generated from statistical aggregations of the MSCs and MSVs of all modulation subbands. An information fusion approach which integrates both feature level fusion method and decision level combination method is employed to improve the classification accuracy. Experiments conducted on two different music datasets have shown that our proposed approach can achieve higher classification accuracy than other approaches with the same experimental setup.