Unit selection in a concatenative speech synthesis system using a large speech database
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Perceptual and objective detection of discontinuities in concatenative speech synthesis
ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02
Springer Handbook of Speech Processing
Springer Handbook of Speech Processing
A dynamic cost weighting framework for unit selection text-to-speech synthesis
IEEE Transactions on Audio, Speech, and Language Processing
Development of syllable-based text to speech synthesis system in Bengali
International Journal of Speech Technology
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
IEEE Transactions on Information Theory
A Hybrid Text-to-Speech System That Combines Concatenative and Statistical Synthesis Units
IEEE Transactions on Audio, Speech, and Language Processing
Hi-index | 0.00 |
This paper presents the design and development of syllable specific unit selection cost functions for improving the quality of text-to-speech synthesis. Appropriate unit selection cost functions, namely concatenation cost and target cost, are proposed for syllable based synthesis. Concatenation costs are defined based on the type of segments present at the syllable joins. Proposed concatenation costs have shown significant reduction in perceptual discontinuity at syllable joins. Three-stage target cost formulation is proposed for selecting appropriate units from database. Subjective evaluation has shown improvement in the quality of speech at each stage.