Speech input and output assessment: multilingual methods and standards
Speech input and output assessment: multilingual methods and standards
Rules for Automatic Grapheme-to-Allophone Transcription in Slovene
TDS '00 Proceedings of the Third International Workshop on Text, Speech and Dialogue
Hi-index | 0.00 |
Preparation, recording, segmentation and pitch labelling of Slovenian diphone inventories are described. A special user friendly interface package was developed in order to facilitate these operations. As acquisition of a labelled diphone inventory or adaptation of a speech synthesis system to synthesise further voices is manually intensive, an automatic procedure is required. A speech recogniser, based on Hidden Markov Models in forced segmentation mode is used to outline phone boundaries within spoken logatoms. A statistical evaluation of manual and automatic segmentation discrepancies is performed so as to estimate the reliability of automatically derived labels. Finally, diphone boundaries are determined and pitch markers are assigned to voiced sections of the speech signal.