Pitch accent in context: predicting intonational prominence from text
Artificial Intelligence - Special volume on natural language processing
Regular models of phonological rule systems
Computational Linguistics - Special issue on computational phonology
A stochastic finite-state word-segmentation algorithm for Chinese
Computational Linguistics
Regular expressions for language engineering
Natural Language Engineering
Compilation of weighted finite-state transducers from decision trees
ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
An efficient compiler for weighted rewrite rules
ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
A finite-state morphological processor for Spanish
COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3
Two-level morphology with composition
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 1
Weighted rational transductions and their application to human language processing
HLT '94 Proceedings of the workshop on Human Language Technology
Integrating geometrical and linguistic analysis for email signature block parsing
ACM Transactions on Information Systems (TOIS)
E-Mail Signature Block Analysis
ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 2 - Volume 2
Efficient Parsing of Romanian Language for Text-to-Speech Purposes
TSD '09 Proceedings of the 12th International Conference on Text, Speech and Dialogue
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Hi-index | 0.00 |
We present a model of text analysis for text-to-speech (TTS) synthesis based on (weighted) finite state transducers, which serves as the text analysis module of the multilingual Bell Labs TTS system. The transducers are constructed using a lexical toolkit that allows declarative descriptions of lexicons, morphological rules, numeral-expansion rules, and phonological rules, inter alia. To date, the model has been applied to eight languages: Spanish, Italian, Romanian, French, German, Russian, Mandarin and Japanese.