Adaptive network-based fuzzy inference system vs. other classification algorithms for warped LPC-based speech/music discrimination

Authors:
J. E. Muñoz-Expósito;S. García-Galán;N. Ruiz-Reyes;P. Vera-Candeas
Affiliations:
Telecommunication Engineering Department, Polytechnic School, University of Jaén, Linares, Jaén, Spain;Telecommunication Engineering Department, Polytechnic School, University of Jaén, Linares, Jaén, Spain;Telecommunication Engineering Department, Polytechnic School, University of Jaén, Linares, Jaén, Spain;Telecommunication Engineering Department, Polytechnic School, University of Jaén, Linares, Jaén, Spain
Venue:
Engineering Applications of Artificial Intelligence
Year:
2007

Citing 8
Cited 9

Video Handling with Music and Speech Detection

IEEE MultiMedia
Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Psychoacoustics: Facts and Models

Psychoacoustics: Facts and Models
Real-time discrimination of broadcast speech/music

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
A comparison of features for speech, music discrimination

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Speech/music discrimination for multimedia applications

ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 04
Analysis and design of hierarchical fuzzy systems

IEEE Transactions on Fuzzy Systems
Functional equivalence between radial basis function networks and fuzzy inference systems

IEEE Transactions on Neural Networks

New speech/music discrimination approach based on fundamental frequency estimation

Multimedia Tools and Applications
Intelligent Client-Side Web Caching Scheme Based on Least Recently Used Algorithm and Neuro-Fuzzy System

ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
Polynomial-based radial basis function neural networks (P-RBF NNs) and their application to pattern classification

Applied Intelligence
Two-stage cascaded classification approach based on genetic fuzzy learning for speech/music discrimination

Engineering Applications of Artificial Intelligence
A fuzzy rule-based meta-scheduler with evolutionary learning for grid computing

Engineering Applications of Artificial Intelligence
Improving expert meta-schedulers for grid computing through weighted rules evolution

WILF'11 Proceedings of the 9th international conference on Fuzzy logic and applications
Neuro-fuzzy system in partitioned client-side Web cache

Expert Systems with Applications: An International Journal
The design of polynomial function-based neural network predictors for detection of software defects

Information Sciences: an International Journal
Type-2 fuzzy decision support system to optimise MANET integration into infrastructure-based wireless systems

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic discrimination of speech and music is an important tool in many multimedia applications. The paper presents an effective approach based on an adaptive network-based fuzzy inference system (ANFIS) for the classification stage required in a speech/music discrimination system. A new simple feature, called warped LPC-based spectral centroid (WLPC-SC), is also proposed. Comparison between WLPC-SC and the classical features proposed in the literature for audio classification is performed, aiming to assess the good discriminatory power of the proposed feature. The vector length used to describe the proposed psychoacoustic-based feature is reduced to a few statistical values (mean, variance and skewness). With the aim of increasing the classification accuracy percentage, the feature space is then transformed to a new feature space by LDA. The classification task is performed applying ANFIS to the features in the transformed space. To evaluate the performance of the ANFIS system for speech/music discrimination, comparison to other commonly used classifiers is reported. The classification results for different types of music and speech signals show the good discriminating power of the proposed approach.