A two-level drive – response model of non-stationary speech signals

Authors:
Friedhelm R. Drepper
Affiliations:
Zentralinstitut für Elektronik, Forschungszentrum Jülich GmbH, Jülich, Germany
Venue:
NOLISP'05 Proceedings of the 3rd international conference on Non-Linear Analyses and Algorithms for Speech Processing
Year:
2005

Citing 2
Cited 2

Nonlinear time series analysis

Nonlinear time series analysis
Dynamic time warping comb filter for the enhancement of speech degraded by white Gaussian noise

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II

Voiced speech as response of a self-consistent fundamental drive

Speech Communication
Non-stationary self-consistent acoustic objects as atoms of voiced speech

NOLISP'07 Proceedings of the 2007 international conference on Advances in nonlinear speech processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The transmission protocol of voiced speech is hypothesized to be based on a funda mental drive process, which synchronizes the vocal tract excitation on the trans mitter side and evokes the pitch perception on the receiver side. A band limited fundamental drive is extrac ted from a voice specific subband decom position of the speech signal. When the near periodic drive is used as fun damental drive of a two-level drive-response model, a more or less aperiodic voiced excitation can be recon struc ted as a more or less aperiodic trajectory on a low dimensional continuous syn chro nization manifold (surface) described by speaker and phoneme specific coupling functions. In the case of vowels and nasals the excitation can be described by a univariate coupling function, which depends on the momentary phase of the funda mental drive. In the case of other voiced consonants the coupling function may as well depend on a delayed funda mental phase with a phoneme speci fic time delay. The delay may exceed the length of the analysis window. The resulting long range correlation cannot be analysed or synthesized by models assuming stationary excitation.