Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation

Authors:
Thomas Drugman;Baris Bozkurt;Thierry Dutoit
Affiliations:
TCTS Lab, University of Mons, Belgium;Department of Electrical & Electronics Engineering, Izmir Institute of Technology, Turkey;TCTS Lab, University of Mons, Belgium
Venue:
Speech Communication
Year:
2011

Citing 7
Cited 3

Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering

Speech Communication - Eurospeech '91
Discrete-Time Signal Processing

Discrete-Time Signal Processing
Chirp group delay analysis of speech signals

Speech Communication
A speech spectrum distortion measure with interframe memory

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02
Phase-based methods for voice source analysis

NOLISP'07 Proceedings of the 2007 international conference on Advances in nonlinear speech processing
Estimation of Glottal Closure Instants in Voiced Speech Using the DYPSA Algorithm

IEEE Transactions on Audio, Speech, and Language Processing
A new method for obtaining accurate estimates of vocal-tract filters and glottal waves from vowel sounds

IEEE Transactions on Audio, Speech, and Language Processing

Complex cepstrum for statistical parametric speech synthesis

Speech Communication
Multi classifier systems and forward backward feature selection algorithms to classify emotional coloured speech

Proceedings of the 15th ACM on International conference on multimodal interaction
Analysis and HMM-based synthesis of hypo and hyperarticulated speech

Computer Speech and Language

Quantified Score

Hi-index	0.00

Visualization

Abstract

Complex cepstrum is known in the literature for linearly separating causal and anticausal components. Relying on advances achieved by the Zeros of the Z-Transform (ZZT) technique, we here investigate the possibility of using complex cepstrum for glottal flow estimation on a large-scale database. Via a systematic study of the windowing effects on the deconvolution quality, we show that the complex cepstrum causal-anticausal decomposition can be effectively used for glottal flow estimation when specific windowing criteria are met. It is also shown that this complex cepstral decomposition gives similar glottal estimates as obtained with the ZZT method. However, as complex cepstrum uses FFT operations instead of requiring the factoring of high-degree polynomials, the method benefits from a much higher speed. Finally in our tests on a large corpus of real expressive speech, we show that the proposed method has the potential to be used for voice quality analysis.