Experimental evaluation of tree-based algorithms for intonational breaks representation

  • Authors:
  • Panagiotis Zervas;Gerasimos Xydas;Nikolaos Fakotakis;George Kokkinakis;Georgios Kouroupetroglou

  • Affiliations:
  • Electrical and Computer Engineering Dept, University of Patras, Greece;Department of Informatics and Telecommunications, University of Athens, Greece;Electrical and Computer Engineering Dept, University of Patras, Greece;Electrical and Computer Engineering Dept, University of Patras, Greece;Department of Informatics and Telecommunications, University of Athens, Greece

  • Venue:
  • TSD'05 Proceedings of the 8th international conference on Text, Speech and Dialogue
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The prosodic specification of an utterance to be spoken by a Text-to-Speech synthesis system can be devised in break indices, pitch accents and boundary tones. In particular, the identification of break indices formulates the intonational phrase breaks that affect all the forthcoming prosody-related procedures. In the present paper we use tree-structured predictors, and specifically the commonly used in similar tasks CART and the introduced C4.5 one, to cope with the task of break placement in the presence of shallow textual features. We have utilized two 500-utterance prosodic corpora offered by two Greek universities in order to compare the machine learning approaches and to argue on the robustness they offer for Greek break modeling. The evaluation of the resulted models revealed that both approaches were positively compared with similar works published for other languages, while the C4.5 method accuracy scaled from 1% to 2,7% better than CART.