Segmentation and labelling of Slovenian diphone inventories

  • Authors:
  • Jerneja Gros;Ivo Ipšić;Simon Dobrišek;France Mihelič;Nikola Pavešić

  • Affiliations:
  • University of Ljubljana, Ljubljana, Slovenia;University of Ljubljana, Ljubljana, Slovenia;University of Ljubljana, Ljubljana, Slovenia;University of Ljubljana, Ljubljana, Slovenia;University of Ljubljana, Ljubljana, Slovenia

  • Venue:
  • COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

Preparation, recording, segmentation and pitch labelling of Slovenian diphone inventories are described. A special user friendly interface package was developed in order to facilitate these operations. As acquisition of a labelled diphone inventory or adaptation of a speech synthesis system to synthesise further voices is manually intensive, an automatic procedure is required. A speech recogniser, based on Hidden Markov Models in forced segmentation mode is used to outline phone boundaries within spoken logatoms. A statistical evaluation of manual and automatic segmentation discrepancies is performed so as to estimate the reliability of automatically derived labels. Finally, diphone boundaries are determined and pitch markers are assigned to voiced sections of the speech signal.