Human evaluation of the LOGOS' spoken dialogue system
Proceedings of the 1st international conference on PErvasive Technologies Related to Assistive Environments
Improving phone duration modelling using support vector regression fusion
Speech Communication
Feature selection for improved phone duration modeling of greek emotional speech
SETN'10 Proceedings of the 6th Hellenic conference on Artificial Intelligence: theories, models and applications
Duration modeling of phonemes for Amharic text to speech system
Proceedings of the International Conference on Management of Emergent Digital EcoSystems
Hi-index | 0.00 |
In this paper we cope with the task of modeling phoneme duration for Greek speech synthesis. In particular we apply well established machine learning approaches to the WCL-1 prosodic database for predicting segmental durations from shallow morphosyntactic and prosodic features. We employ decision trees, instance based learning and linear regression. Trained on a 5500 word database, both CART and linear regression models proved to be the most effective in terms for the task with a root mean square error of 0.0252 and 0.0251 respectively.