Prosody analysis of Thai emotion utterances

Authors:
Sukanya Yimngam;Wichian Premchaisawadi;Worapoj Kreesuradej
Affiliations:
Technology King Mongkut's Institute of Technology Ladkrabang, Bangkok, Thailand;Graduate School of Information Technology, Siam University, Bangkok, Thailand;Technology King Mongkut's Institute of Technology Ladkrabang, Bangkok, Thailand
Venue:
NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Year:
2011

Citing 2
Cited 0

A context-sensitive homograph disambiguation in Thai text-to-speech synthesis

NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Thai speech processing technology: A review

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

Emotion speech synthesis is the most important process to generate the naturalness of utterances in text-to-speech system. The interjection utterances in Thai language are employed in express a number of emotions. This paper presents a study of the prosody parameters of the interjection utterances clipped from Thai utterances in the movies. The Thai emotional utterances from various movies have been analyzed and classified into 8 emotional types consisting of neutral, anger, happiness, sadness, fear, pleasant, unpleasant and surprise. The classification of prosodic features is based on fundamental frequency (F0), intensity and duration. This paper compares the prosodic features in the Thai language and other languages including English, Italian, French, Spanish and Arabic. The comparison results show that there are significant differences of prosodic features for each emotion in each language. Therefore, the quality of a text-to-speech system is based on the prosodic analysis of each language.