Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise

Authors:
Tuomo Raitio;Antti Suni;Martti Vainio;Paavo Alku
Affiliations:
-;-;-;-
Venue:
Computer Speech and Language
Year:
2014

Citing 19
Cited 1

Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering

Speech Communication - Eurospeech '91
Numerical algorithms with C

Numerical algorithms with C
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds

Speech Communication
The role of voice quality in communicating emotion, mood and attitude

Speech Communication - Special issue on speech and emotion
Acoustic Modeling of Speaking Styles and Emotional Expressions in HMM-Based Speech Synthesis

IEICE - Transactions on Information and Systems
Loudmouth:: modifying text-to-speech synthesis in noise

Proceedings of the 8th international ACM SIGACCESS conference on Computers and accessibility
Speech Synthesis with Various Emotional Expressions and Speaking Styles by Style Interpolation and Morphing

IEICE - Transactions on Information and Systems
A Style Adaptation Technique for Speech Synthesis Using HSMM and Suprasegmental Features

IEICE - Transactions on Information and Systems
Unit selection in a concatenative speech synthesis system using a large speech database

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
A Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis

IEICE - Transactions on Information and Systems
A Style Control Technique for HMM-Based Expressive Speech Synthesis

IEICE - Transactions on Information and Systems
Review: Statistical parametric speech synthesis

Speech Communication
Analysis of the roles and the dynamics of breathy and whispery voice qualities in dialogue speech

EURASIP Journal on Audio, Speech, and Music Processing - Special issue on atypical speech
Vocal effort modification through harmonics plus noise model representation

NOLISP'11 Proceedings of the 5th international conference on Advances in nonlinear speech processing
Impact of vocal effort variability on automatic speech recognition

Speech Communication
Analysis of Speaker Adaptation Algorithms for HMM-Based Speech Synthesis and a Constrained SMAPLR Adaptation Algorithm

IEEE Transactions on Audio, Speech, and Language Processing
Transforming Perceived Vocal Effort and Breathiness Using Adaptive Pre-Emphasis Linear Prediction

IEEE Transactions on Audio, Speech, and Language Processing
HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering

IEEE Transactions on Audio, Speech, and Language Processing
The Deterministic Plus Stochastic Model of the Residual Signal and Its Applications

IEEE Transactions on Audio, Speech, and Language Processing

Phonetic feature extraction for context-sensitive glottal source processing

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

This papers studies the synthesis of speech over a wide vocal effort continuum and its perception in the presence of noise. Three types of speech are recorded and studied along the continuum: breathy, normal, and Lombard speech. Corresponding synthetic voices are created by training and adapting the statistical parametric speech synthesis system GlottHMM. Natural and synthetic speech along the continuum is assessed in listening tests that evaluate the intelligibility, quality, and suitability of speech in three different realistic multichannel noise conditions: silence, moderate street noise, and extreme street noise. The evaluation results show that the synthesized voices with varying vocal effort are rated similarly to their natural counterparts both in terms of intelligibility and suitability.