Applying data mining techniques to corpus based prosodic modeling

Authors:
David Escudero-Mancebo;Valentín Cardeñoso-Payo
Affiliations:
Departamento de Informática, Universidad de Valladolid, Campus Miguel Delibes s/n, 47011 Valladolid, Spain;Departamento de Informática, Universidad de Valladolid, Campus Miguel Delibes s/n, 47011 Valladolid, Spain
Venue:
Speech Communication
Year:
2007

Citing 10
Cited 5

From text to speech: the MITalk system

From text to speech: the MITalk system
An introduction to splines for use in computer graphics & geometric modeling

An introduction to splines for use in computer graphics & geometric modeling
The rise/fall/connection model of intonation

Speech Communication
A stochastic model of intonation for text-to-speech synthesis

Speech Communication
Data clustering: a review

ACM Computing Surveys (CSUR)
Developments and paradigms in intonation research

Speech Communication
Prosody modeling with soft templates

Speech Communication
Curve-fitting with piecewise parametric cubics

SIGGRAPH '83 Proceedings of the 10th annual conference on Computer graphics and interactive techniques
Data-driven generation of F0 contours using a superpositional model

Speech Communication
New rule-based and data-driven strategy to incorporate Fujisaki's F/sub 0/ model to a text-to-speech system in Castillian Spanish

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02

Cross-lingual English Spanish tonal accent labeling using decision trees and neural networks

NOLISP'11 Proceedings of the 5th international conference on Advances in nonlinear speech processing
Production of filled pauses in concatenative speech synthesis based on the underlying fluent sentence

Speech Communication
Review: Data mining techniques and applications - A decade review from 2000 to 2011

Expert Systems with Applications: An International Journal
A fuzzy classifier to deal with similarity between labels on automatic prosodic labeling

Computer Speech and Language
Glissando: a corpus for multidisciplinary prosodic studies in Spanish and Catalan

Language Resources and Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

This article presents MEMOInt, a methodology to automatically extract the intonation patterns which characterize a given corpus, with applications in text-to-speech systems. Easy to understand information about the form of the characteristic patterns found in the corpus can be obtained from MEMOint in a way which allows easy comparison with other proposals. A visual representation of the relationship between the set of prosodic features which could have been selected to label the corpus and the intonation contour patterns is also easy to obtain. The particular function-form correspondence associated to the given corpus is represented by means of a list of dictionaries of classes of parameterized F0 patterns, where the access key is given by a sequence of prosodic features. MEMOInt can also be used to obtain valuable information about the relative impact of the use of different parameterization techniques of F0 contours or of different types of intonation units and information about the relevance of different prosodic features. The methodology has been specifically designed to provide a successful strategy to solve the data sparseness problem which usually affects corpora as a consequence of the inherent high variability of the intonation phenomenon.