Multilingual text analysis for text-to-speech synthesis

Authors:
Richard Sproat
Affiliations:
Language Modeling Research Department, Bell Laboratories, Lucent Technologies, 700 Mountain Avenue, Room 2d-451, Murray Hill, NJ, 07974–0636, USA. e-mail: rws@bell-labs.com
Venue:
Natural Language Engineering
Year:
1996

Citing 9
Cited 4

Pitch accent in context: predicting intonational prominence from text

Artificial Intelligence - Special volume on natural language processing
Regular models of phonological rule systems

Computational Linguistics - Special issue on computational phonology
A stochastic finite-state word-segmentation algorithm for Chinese

Computational Linguistics
Regular expressions for language engineering

Natural Language Engineering
Compilation of weighted finite-state transducers from decision trees

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
An efficient compiler for weighted rewrite rules

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
A finite-state morphological processor for Spanish

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3
Two-level morphology with composition

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 1
Weighted rational transductions and their application to human language processing

HLT '94 Proceedings of the workshop on Human Language Technology

Integrating geometrical and linguistic analysis for email signature block parsing

ACM Transactions on Information Systems (TOIS)
E-Mail Signature Block Analysis

ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 2 - Volume 2
Efficient Parsing of Romanian Language for Text-to-Speech Purposes

TSD '09 Proceedings of the 12th International Conference on Text, Speech and Dialogue
Enhanced word decomposition by calibrating the decision threshold of probabilistic models and using a model ensemble

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a model of text analysis for text-to-speech (TTS) synthesis based on (weighted) finite state transducers, which serves as the text analysis module of the multilingual Bell Labs TTS system. The transducers are constructed using a lexical toolkit that allows declarative descriptions of lexicons, morphological rules, numeral-expansion rules, and phonological rules, inter alia. To date, the model has been applied to eight languages: Spanish, Italian, Romanian, French, German, Russian, Mandarin and Japanese.