Joint source-filter optimization for robust glottal source estimation in the presence of shimmer and jitter

Authors:
Prasanta Kumar Ghosh;Shrikanth S. Narayanan
Affiliations:
Signal Analysis and Interpretation Laboratory, Department of Electrical Engineering, University of Southern California, Los Angeles, CA 90089, USA;Signal Analysis and Interpretation Laboratory, Department of Electrical Engineering, University of Southern California, Los Angeles, CA 90089, USA
Venue:
Speech Communication
Year:
2011

Citing 6
Cited 0

Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering

Speech Communication - Eurospeech '91
Linear Prediction of Speech

Linear Prediction of Speech
Discrete-time speech signal processing: principles and practice

Discrete-time speech signal processing: principles and practice
Theory and Applications of Digital Speech Processing

Theory and Applications of Digital Speech Processing
Glottal source estimation using a sum-of-exponentials model

IEEE Transactions on Signal Processing
Robust glottal source estimation based on joint source-filter model optimization

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.01

Visualization

Abstract

We propose a glottal source estimation method robust to shimmer and jitter in the glottal flow. The proposed estimation method is based on a joint source-filter optimization technique. The glottal source is modeled by the Liljencrants-Fant (LF) model and the vocal-tract filter is modeled by an auto-regressive filter, which is common in the source-filter approach to speech production. The optimization estimates the parameters of the LF model, the amplitudes of the glottal flow in each pitch period, and the vocal-tract filter coefficients so that the speech production model best describes the observed speech samples. Experiments with synthetic and real speech data show that the proposed estimation method is robust to different phonation types with varying shimmer and jitter characteristics.