Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Working Set Selection Using Second Order Information for Training Support Vector Machines
The Journal of Machine Learning Research
A comparison of features for speech, music discrimination
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Evaluation of Objective Quality Measures for Speech Enhancement
IEEE Transactions on Audio, Speech, and Language Processing
A speech/music discriminator based on RMS and zero-crossings
IEEE Transactions on Multimedia
A Speech/Music Discriminator of Radio Recordings Based on Dynamic Programming and Bayesian Networks
IEEE Transactions on Multimedia
Hi-index | 0.00 |
While many efforts have been made in the audio signal classification field, the noise interruption problem is seldom concerned so far, especially in many telecommunication applications, where a real-time and noise robust approach is needed. This paper addresses this problem by proposing two novel robust features: Average Pitch Density (APD) and Relative Tonal Power Density (RTPD). APD refers to the differences in tone characteristics of music and speech signals, and RTPD especially focuses on the distinct properties of the percussion instruments. The comparison experiments are implemented on two databases. The first one is reorganized from the corpus collected by Scheirer et al [3]. The second one consists of data collected from various recording situations. The novel features are compared with several state-of-the-art features and are found to achieve significant robustness.