Chirp group delay analysis of speech signals

Authors:
Baris Bozkurt;Laurent Couvreur;Thierry Dutoit
Affiliations:
TCTS Lab., Faculté Polytechnique De Mons, Initialis Scientific Parc, B-7000 Mons, Belgium;TCTS Lab., Faculté Polytechnique De Mons, Initialis Scientific Parc, B-7000 Mons, Belgium;TCTS Lab., Faculté Polytechnique De Mons, Initialis Scientific Parc, B-7000 Mons, Belgium
Venue:
Speech Communication
Year:
2007

Citing 12
Cited 4

Speech processing using group delay functions

Signal Processing
Formant extraction from group delay function

Speech Communication
Speech recognition in noisy environments: a survey

Speech Communication
Polynomial roots from companion matrix eigenvalues

Mathematics of Computation
Effects of phase on the perception of intervocalic stop consonants

Speech Communication
Discrete-time signal processing (2nd ed.)

Discrete-time signal processing (2nd ed.)
Robust speech recognition in embedded system and PC applications

Robust speech recognition in embedded system and PC applications
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development

Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
Connectionist Speech Recognition: A Hybrid Approach

Connectionist Speech Recognition: A Hybrid Approach
Phase Modelling of Speech Excitation for Low Bit-Rate Sinusoidal Transform Coding

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 3 - Volume 3
On phase perception in speech

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
A study on perceptual distance measure for phase spectrum of stimuli

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 05

Three dimensions of pitched instrument onset detection

IEEE Transactions on Audio, Speech, and Language Processing
Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation

Speech Communication
The hartley phase spectrum as an assistive feature for classification

NOLISP'09 Proceedings of the 2009 international conference on Advances in Nonlinear Speech Processing
Quasi Closed Phase Glottal Inverse Filtering Analysis With Weighted Linear Prediction

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

This study proposes new group delay estimation techniques that can be used for analyzing resonance patterns of short-term discrete-time signals and more specifically speech signals. Phase processing or equivalently group delay processing of speech signals are known to be difficult due to large spikes in the phase/group delay functions that mask the formant structure. In this study, we first analyze in detail the z-transform zero patterns of short-term speech signals in the z-plane and discuss the sources of spikes on group delay functions, namely the zeros closely located to the unit circle. We show that windowing largely influences these patterns, therefore short-term phase processing. Through a systematic study, we then show that reliable phase/group delay estimation for speech signals can be achieved by appropriate windowing and group delay functions can reveal formant information as well as some of the characteristics of the glottal flow component in speech signals. However, such phase estimation is highly sensitive to noise and robust extraction of group delay based parameters remains difficult in real acoustic conditions even with appropriate windowing. As an alternative, we propose processing of chirp group delay functions, i.e. group delay functions computed on a circle other than the unit circle in z-plane, which can be guaranteed to be spike-free. We finally present one application in feature extraction for automatic speech recognition (ASR). We show that chirp group delay representations are potentially useful for improving ASR performance.