A speech synthesizer for Persian text using a neural network with a smooth ergodic HMM

Authors:
F. Hendessi;A. Ghayoori;T. A. Gulliver
Affiliations:
Isfahan University of Technology, Isfahan, Iran;Isfahan University of Technology, Isfahan, Iran;University of Victoria, Victoria, B.C., Canada
Venue:
ACM Transactions on Asian Language Information Processing (TALIP)
Year:
2005

Citing 2
Cited 2

Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones

Speech Communication
Speech activated telephony email reader (SATER) based on speaker verification and text-to-speech conversion

IEEE Transactions on Consumer Electronics

Language-independent, neural network-based, text-to-phones conversion

Neurocomputing
Implementation of Three Text to Speech Systems for Kurdish Language

CIARP '09 Proceedings of the 14th Iberoamerican Conference on Pattern Recognition: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

The feasibility of converting text into speech using an inexpensive computer with minimal memory is of great interest. Speech synthesizers have been developed for many popular languages (e.g., English, Chinese, Spanish, French, etc.), but designing a speech synthesizer for a language is largely dependant on the language structure. In this article, we develop a Persian synthesizer that includes an innovative text analyzer module. In the synthesizer, the text is segmented into words and after preprocessing, a neural network is passed over each word. In addition to preprocessing, a new model (SEHMM) is used as a postprocessor to compensate for errors generated by the neural network. The performance of the proposed model is verified and the intelligibility of the synthetic speech is assessed via listening tests.