Video Handling with Music and Speech Detection
IEEE MultiMedia
SIA: A Supervised Inductive Algorithm with Genetic Search for Learning Attributes based Concepts
ECML '93 Proceedings of the European Conference on Machine Learning
Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
A learning system based on genetic adaptive algorithms
A learning system based on genetic adaptive algorithms
Intelligent behavior as an adaptation to the task environment
Intelligent behavior as an adaptation to the task environment
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
Real-time discrimination of broadcast speech/music
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
A comparison of features for speech, music discrimination
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Speech/music discrimination for multimedia applications
ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 04
Engineering Applications of Artificial Intelligence
Combining Spectral Representations for Large-Vocabulary Continuous Speech Recognition
IEEE Transactions on Audio, Speech, and Language Processing
Discriminating Between Pitched Sources in Music Audio
IEEE Transactions on Audio, Speech, and Language Processing
A speech/music discriminator based on RMS and zero-crossings
IEEE Transactions on Multimedia
Low-complexity F0-based speech/nonspeech discrimination approach for digital hearing aids
Multimedia Tools and Applications
Hi-index | 0.00 |
Automatic discrimination of speech and music is an important tool in many multimedia applications. The paper presents a robust and effective approach for speech/music discrimination, which relies on a set of features derived from fundamental frequency (F0) estimation. Comparison between the proposed set of features and some commonly used timbral features is performed, aiming to assess the good discriminatory power of the proposed F0-based feature set. The classification scheme is composed of a classical Statistical Pattern Recognition classifier followed by a Fuzzy Rules Based System. Comparison with other well-proven classification schemes is also performed. Experimental results reveal that our speech/music discriminator is robust enough, making it suitable for a wide variety of multimedia applications.