Combining Visual and Acoustic Features for Music Genre Classification

  • Authors:
  • Ming-Ju Wu;Zhi-Sheng Chen;Jyh-Shing Roger Jang;Jia-Min Ren;Yi-Hsung Li;Chun-Hung Lu

  • Affiliations:
  • -;-;-;-;-;-

  • Venue:
  • ICMLA '11 Proceedings of the 2011 10th International Conference on Machine Learning and Applications and Workshops - Volume 02
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Music genre classification is a challenging task in the field of music information retrieval. Existing approaches usually attempt to extract features only from acoustic aspect. However, spectrogram also provides useful information because it describes the temporal change of energy distribution over frequency bins. In this paper, we propose the use of Gabor filters to generate effective visual features that can capture the characteristics of a spectrogram隆娄s texture patterns. On the other hand, acoustic features are extracted using universal background model and maximum a posteriori adaptation. Based on these two types of features, we then employ SVM to perform the final classification task. Experimental results demonstrate that combining visual and acoustic features can achieve satisfactory classification accuracy on two widely used datasets.