Machine-readable dictionaries in text-to-speech systems

Authors:
Judith L. Klavans;Evelyne Tzoukermann
Affiliations:
Columbia University, New York, New York;A. T.&T. Bell Laboratories, Murray Hill, N. J.
Venue:
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Year:
1994

Citing 5
Cited 0

LDOCE and speech recognition

Computational lexicography for natural language processing
Extracting semantic hierarchies from a large on-line dictionary

ACL '85 Proceedings of the 23rd annual meeting on Association for Computational Linguistics
Dictionaries, dictionary grammars and dictionary entry parsing

ACL '89 Proceedings of the 27th annual meeting on Association for Computational Linguistics
The BICORD system: combining lexical information from bilingual corpora and machine readable dictionaries

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3
A finite-state morphological processor for Spanish

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents the results of an experiment using machine-readable dictionaries (MRDs) and corpora for building concatenative units for text to speech (TTS) systems. Theoretical questions concerning the nature of phonemic data in dictionaries are raised; phonemic dictionary data is viewed as a representative corpus over which to extract n-gram phonemic frequencies in the language. Dictionary data are compared to corpus data, and phoneme inventories are evaluated for coverage. A methodology is defined to compute phonemic n-grams for incorporation into a TTS system.