New speech/music discrimination approach based on fundamental frequency estimation

Authors:
N. Ruiz-Reyes;P. Vera-Candeas;J. E. Muñoz;S. García-Galán;F. J. Cañadas
Affiliations:
Department of Telecommunication Engineering, University of Jaén Polytechnic School, Linares, Spain;Department of Telecommunication Engineering, University of Jaén Polytechnic School, Linares, Spain;Department of Telecommunication Engineering, University of Jaén Polytechnic School, Linares, Spain;Department of Telecommunication Engineering, University of Jaén Polytechnic School, Linares, Spain;Department of Telecommunication Engineering, University of Jaén Polytechnic School, Linares, Spain
Venue:
Multimedia Tools and Applications
Year:
2009

Citing 15
Cited 1

Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds

Speech Communication
Video Handling with Music and Speech Detection

IEEE MultiMedia
SIA: A Supervised Inductive Algorithm with Genetic Search for Learning Attributes based Concepts

ECML '93 Proceedings of the European Conference on Machine Learning
Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
A learning system based on genetic adaptive algorithms

A learning system based on genetic adaptive algorithms
Intelligent behavior as an adaptation to the task environment

Intelligent behavior as an adaptation to the task environment
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Real-time discrimination of broadcast speech/music

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Prosodic word boundary detection using statistical modeling of moraic fundamental frequency contours and its use for continuous speech recognition

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
A comparison of features for speech, music discrimination

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Speech/music discrimination for multimedia applications

ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 04
Adaptive network-based fuzzy inference system vs. other classification algorithms for warped LPC-based speech/music discrimination

Engineering Applications of Artificial Intelligence
Combining Spectral Representations for Large-Vocabulary Continuous Speech Recognition

IEEE Transactions on Audio, Speech, and Language Processing
Discriminating Between Pitched Sources in Music Audio

IEEE Transactions on Audio, Speech, and Language Processing
A speech/music discriminator based on RMS and zero-crossings

IEEE Transactions on Multimedia

Low-complexity F0-based speech/nonspeech discrimination approach for digital hearing aids

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic discrimination of speech and music is an important tool in many multimedia applications. The paper presents a robust and effective approach for speech/music discrimination, which relies on a set of features derived from fundamental frequency (F0) estimation. Comparison between the proposed set of features and some commonly used timbral features is performed, aiming to assess the good discriminatory power of the proposed F0-based feature set. The classification scheme is composed of a classical Statistical Pattern Recognition classifier followed by a Fuzzy Rules Based System. Comparison with other well-proven classification schemes is also performed. Experimental results reveal that our speech/music discriminator is robust enough, making it suitable for a wide variety of multimedia applications.