Speech segmentation without speech recognition

  • Authors:
  • Dong Wang;Lie Lu;Hong-Jiang Zhang

  • Affiliations:
  • Dept. of Electron. Eng., Tsinghua Univ., Beijing, China;Inst. for Human-Comput. Commun., Technische Univ. Munchen, Germany;Inst. for Human-Comput. Commun., Technische Univ. Munchen, Germany

  • Venue:
  • ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we presented a semantic speech segmentation approach, in particular sentence segmentation, without speech recognition. In order to get phoneme level information without word recognition information, a novel vowel/consonant/pause (V/C/P) classification is proposed. An adaptive pause detection method is also presented to adapt to various background and environment. Three feature sets, which include pause, rate of speech and prosody, are used to discriminate the sentence boundary. Experiments on broadcasting news indicate that the performance of proposed algorithm is satisfying.