Adaptive network-based fuzzy inference system vs. other classification algorithms for warped LPC-based speech/music discrimination

  • Authors:
  • J. E. Muñoz-Expósito;S. García-Galán;N. Ruiz-Reyes;P. Vera-Candeas

  • Affiliations:
  • Telecommunication Engineering Department, Polytechnic School, University of Jaén, Linares, Jaén, Spain;Telecommunication Engineering Department, Polytechnic School, University of Jaén, Linares, Jaén, Spain;Telecommunication Engineering Department, Polytechnic School, University of Jaén, Linares, Jaén, Spain;Telecommunication Engineering Department, Polytechnic School, University of Jaén, Linares, Jaén, Spain

  • Venue:
  • Engineering Applications of Artificial Intelligence
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automatic discrimination of speech and music is an important tool in many multimedia applications. The paper presents an effective approach based on an adaptive network-based fuzzy inference system (ANFIS) for the classification stage required in a speech/music discrimination system. A new simple feature, called warped LPC-based spectral centroid (WLPC-SC), is also proposed. Comparison between WLPC-SC and the classical features proposed in the literature for audio classification is performed, aiming to assess the good discriminatory power of the proposed feature. The vector length used to describe the proposed psychoacoustic-based feature is reduced to a few statistical values (mean, variance and skewness). With the aim of increasing the classification accuracy percentage, the feature space is then transformed to a new feature space by LDA. The classification task is performed applying ANFIS to the features in the transformed space. To evaluate the performance of the ANFIS system for speech/music discrimination, comparison to other commonly used classifiers is reported. The classification results for different types of music and speech signals show the good discriminating power of the proposed approach.