Segmentation and labelling of Slovenian diphone inventories

Authors:
Jerneja Gros;Ivo Ipšić;Simon Dobrišek;France Mihelič;Nikola Pavešić
Affiliations:
University of Ljubljana, Ljubljana, Slovenia;University of Ljubljana, Ljubljana, Slovenia;University of Ljubljana, Ljubljana, Slovenia;University of Ljubljana, Ljubljana, Slovenia;University of Ljubljana, Ljubljana, Slovenia
Venue:
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Year:
1996

Citing 2
Cited 1

Speech input and output assessment: multilingual methods and standards

Speech input and output assessment: multilingual methods and standards
Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones

Speech Communication

Rules for Automatic Grapheme-to-Allophone Transcription in Slovene

TDS '00 Proceedings of the Third International Workshop on Text, Speech and Dialogue

Quantified Score

Hi-index	0.00

Visualization

Abstract

Preparation, recording, segmentation and pitch labelling of Slovenian diphone inventories are described. A special user friendly interface package was developed in order to facilitate these operations. As acquisition of a labelled diphone inventory or adaptation of a speech synthesis system to synthesise further voices is manually intensive, an automatic procedure is required. A speech recogniser, based on Hidden Markov Models in forced segmentation mode is used to outline phone boundaries within spoken logatoms. A statistical evaluation of manual and automatic segmentation discrepancies is performed so as to estimate the reliability of automatically derived labels. Finally, diphone boundaries are determined and pitch markers are assigned to voiced sections of the speech signal.