Robust speech detection based on phoneme recognition features

Authors:
France Mihelič;Janez Žibert
Affiliations:
Faculty of Electrical Engineering, University of Ljubljana, Ljubljana, Slovenia;Faculty of Electrical Engineering, University of Ljubljana, Ljubljana, Slovenia
Venue:
TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
Year:
2006

Citing 2
Cited 1

An overview of Broadcast News corpora

Speech Communication - Special issue on automatic transcription of broadcast news data
Speech/music segmentation using entropy and dynamism features in a HMM classification framework

Speech Communication

Gabor-Based Kernel Partial-Least-Squares Discrimination Features for Face Recognition

Informatica

Quantified Score

Hi-index	0.00

Visualization

Abstract

We introduce new method for discriminating speech and non-speech segments in audio signals based on the transcriptions produced by phoneme recognizers Four measures based on consonant-vowels and voiced-unvoiced pairs obtained from different phonemes speech recognizers were proposed They were constructed in a way to be recognizer and language independent and could be applied in different segmentation-classification frameworks The segmentation systems were evaluated on different broadcast news datasets consisted of more than 60 hours of multilingual BN shows The results of these evaluations illustrate the robustness of the proposed features in comparison to MFCC and posterior probability based features The overall frame accuracies of the proposed approaches varied in range from 95% to 98% and remained stable through different test conditions and different phoneme recognizers.