Robust speech detection based on phoneme recognition features

  • Authors:
  • France Mihelič;Janez Žibert

  • Affiliations:
  • Faculty of Electrical Engineering, University of Ljubljana, Ljubljana, Slovenia;Faculty of Electrical Engineering, University of Ljubljana, Ljubljana, Slovenia

  • Venue:
  • TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We introduce new method for discriminating speech and non-speech segments in audio signals based on the transcriptions produced by phoneme recognizers Four measures based on consonant-vowels and voiced-unvoiced pairs obtained from different phonemes speech recognizers were proposed They were constructed in a way to be recognizer and language independent and could be applied in different segmentation-classification frameworks The segmentation systems were evaluated on different broadcast news datasets consisted of more than 60 hours of multilingual BN shows The results of these evaluations illustrate the robustness of the proposed features in comparison to MFCC and posterior probability based features The overall frame accuracies of the proposed approaches varied in range from 95% to 98% and remained stable through different test conditions and different phoneme recognizers.