Video Handling with Music and Speech Detection
IEEE MultiMedia
Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Psychoacoustics: Facts and Models
Psychoacoustics: Facts and Models
Real-time discrimination of broadcast speech/music
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
A comparison of features for speech, music discrimination
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Speech/music discrimination for multimedia applications
ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 04
Analysis and design of hierarchical fuzzy systems
IEEE Transactions on Fuzzy Systems
Functional equivalence between radial basis function networks and fuzzy inference systems
IEEE Transactions on Neural Networks
New speech/music discrimination approach based on fundamental frequency estimation
Multimedia Tools and Applications
ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
Engineering Applications of Artificial Intelligence
A fuzzy rule-based meta-scheduler with evolutionary learning for grid computing
Engineering Applications of Artificial Intelligence
Improving expert meta-schedulers for grid computing through weighted rules evolution
WILF'11 Proceedings of the 9th international conference on Fuzzy logic and applications
Neuro-fuzzy system in partitioned client-side Web cache
Expert Systems with Applications: An International Journal
The design of polynomial function-based neural network predictors for detection of software defects
Information Sciences: an International Journal
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
Automatic discrimination of speech and music is an important tool in many multimedia applications. The paper presents an effective approach based on an adaptive network-based fuzzy inference system (ANFIS) for the classification stage required in a speech/music discrimination system. A new simple feature, called warped LPC-based spectral centroid (WLPC-SC), is also proposed. Comparison between WLPC-SC and the classical features proposed in the literature for audio classification is performed, aiming to assess the good discriminatory power of the proposed feature. The vector length used to describe the proposed psychoacoustic-based feature is reduced to a few statistical values (mean, variance and skewness). With the aim of increasing the classification accuracy percentage, the feature space is then transformed to a new feature space by LDA. The classification task is performed applying ANFIS to the features in the transformed space. To evaluate the performance of the ANFIS system for speech/music discrimination, comparison to other commonly used classifiers is reported. The classification results for different types of music and speech signals show the good discriminating power of the proposed approach.