Music genre classification using LBP textural features

  • Authors:
  • Y. M. G. Costa;L. S. Oliveira;A. L. Koerich;F. Gouyon;J. G. Martins

  • Affiliations:
  • State University of Maringá (UEM), Av. Colombo, 5790 - Bloco C56, Maringá, PR 87020-900, Brazil and Federal University of Paraná (UFPR), Rua Cel. Francisco H. dos Santos, 100, Curit ...;Federal University of Paraná (UFPR), Rua Cel. Francisco H. dos Santos, 100, Curitiba, PR 81531-990, Brazil;Federal University of Paraná (UFPR), Rua Cel. Francisco H. dos Santos, 100, Curitiba, PR 81531-990, Brazil and Pontifical Catholic University of Paraná (PUCPR), R. Imaculada Conceiç ...;Institute for Systems and Computer Engineering of Porto (INESC), R. Dr. Roberto Frias, 378, Porto 4200-465, Portugal;Federal University of Paraná (UFPR), Rua Cel. Francisco H. dos Santos, 100, Curitiba, PR 81531-990, Brazil and Federal Technological University of Paraná (UTFPR), R. Cristo Rei, 19, Tole ...

  • Venue:
  • Signal Processing
  • Year:
  • 2012

Quantified Score

Hi-index 0.08

Visualization

Abstract

In this paper we present an approach to music genre classification which converts an audio signal into spectrograms and extracts texture features from these time-frequency images which are then used for modeling music genres in a classification system. The texture features are based on Local Binary Pattern, a structural texture operator that has been successful in recent image classification research. Experiments are performed with two well-known datasets: the Latin Music Database (LMD), and the ISMIR 2004 dataset. The proposed approach takes into account some different zoning mechanisms to perform local feature extraction. Results obtained with and without local feature extraction are compared. We compare the performance of texture features with that of commonly used audio content based features (i.e. from the MARSYAS framework), and show that texture features always outperforms the audio content based features. We also compare our results with results from the literature. On the LMD, the performance of our approach reaches about 82.33%, above the best result obtained in the MIREX 2010 competition on that dataset. On the ISMIR 2004 database, the best result obtained is about 80.65%, i.e. below the best result on that dataset found in the literature.