Learning to say it well: reranking realizations by predicted synthesis quality

Authors:
Crystal Nakatsu;Michael White
Affiliations:
The Ohio State University, Columbus, OH;The Ohio State University, Columbus, OH
Venue:
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Year:
2006

Citing 13
Cited 8

Specifying intonation from context for speech synthesis

Speech Communication
Discriminative Reranking for Natural Language Parsing

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
An Efficient Boosting Algorithm for Combining Preferences

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Optimizing search engines using clickthrough data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Generation that exploits corpus-based statistical knowledge

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Assigning intonational features in synthesized spoken directions

ACL '88 Proceedings of the 26th annual meeting on Association for Computational Linguistics
Speaking with hands: creating animated conversational characters from recordings of human performance

ACM SIGGRAPH 2004 Papers
Extracting paraphrases from a parallel corpus

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Syntax-based alignment of multiple translations: extracting paraphrases and generating new sentences

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Modeling local coherence: an entity-based approach

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Unit selection in a concatenative speech synthesis system using a large speech database

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Techniques for text planning with XSLT

NLPXML '04 Proceeedings of the Workshop on NLP and XML (NLPXML-2004): RDF/RDFS and OWL in Language Technology
CCG chart realization from disjunctive inputs

INLG '06 Proceedings of the Fourth International Natural Language Generation Conference

Natural language generation as planning under uncertainty for spoken dialogue systems

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Individual and domain adaptation in sentence planning for dialogue

Journal of Artificial Intelligence Research
CCG chart realization from disjunctive inputs

INLG '06 Proceedings of the Fourth International Natural Language Generation Conference
Natural language generation as planning under uncertainty for spoken dialogue systems

Empirical methods in natural language generation
Generating tailored, comparative descriptions with contextually appropriate intonation

Computational Linguistics
Controlling user perceptions of linguistic style: Trainable generation of personality traits

Computational Linguistics
Creating disjunctive logical forms from aligned sentences for grammar-based paraphrase generation

MTTG '11 Proceedings of the Workshop on Monolingual Text-To-Text Generation
Impacts of machine translation and speech synthesis on speech-to-speech translation

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a method for adapting a language generator to the strengths and weaknesses of a synthetic voice, thereby improving the naturalness of synthetic speech in a spoken language dialogue system. The method trains a discriminative reranker to select paraphrases that are predicted to sound natural when synthesized. The ranker is trained on realizer and synthesizer features in supervised fashion, using human judgements of synthetic voice quality on a sample of the paraphrases representative of the generator's capability. Results from a cross-validation study indicate that discriminative paraphrase reranking can achieve substantial improvements in naturalness on average, ameliorating the problem of highly variable synthesis quality typically encountered with today's unit selection synthesizers.