Speech processing using group delay functions
Signal Processing
Formant extraction from group delay function
Speech Communication
Speech recognition in noisy environments: a survey
Speech Communication
Polynomial roots from companion matrix eigenvalues
Mathematics of Computation
Effects of phase on the perception of intervocalic stop consonants
Speech Communication
Discrete-time signal processing (2nd ed.)
Discrete-time signal processing (2nd ed.)
Robust speech recognition in embedded system and PC applications
Robust speech recognition in embedded system and PC applications
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
Connectionist Speech Recognition: A Hybrid Approach
Connectionist Speech Recognition: A Hybrid Approach
Phase Modelling of Speech Excitation for Low Bit-Rate Sinusoidal Transform Coding
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 3 - Volume 3
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
A study on perceptual distance measure for phase spectrum of stimuli
ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 05
Three dimensions of pitched instrument onset detection
IEEE Transactions on Audio, Speech, and Language Processing
The hartley phase spectrum as an assistive feature for classification
NOLISP'09 Proceedings of the 2009 international conference on Advances in Nonlinear Speech Processing
Quasi Closed Phase Glottal Inverse Filtering Analysis With Weighted Linear Prediction
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Hi-index | 0.00 |
This study proposes new group delay estimation techniques that can be used for analyzing resonance patterns of short-term discrete-time signals and more specifically speech signals. Phase processing or equivalently group delay processing of speech signals are known to be difficult due to large spikes in the phase/group delay functions that mask the formant structure. In this study, we first analyze in detail the z-transform zero patterns of short-term speech signals in the z-plane and discuss the sources of spikes on group delay functions, namely the zeros closely located to the unit circle. We show that windowing largely influences these patterns, therefore short-term phase processing. Through a systematic study, we then show that reliable phase/group delay estimation for speech signals can be achieved by appropriate windowing and group delay functions can reveal formant information as well as some of the characteristics of the glottal flow component in speech signals. However, such phase estimation is highly sensitive to noise and robust extraction of group delay based parameters remains difficult in real acoustic conditions even with appropriate windowing. As an alternative, we propose processing of chirp group delay functions, i.e. group delay functions computed on a circle other than the unit circle in z-plane, which can be guaranteed to be spike-free. We finally present one application in feature extraction for automatic speech recognition (ASR). We show that chirp group delay representations are potentially useful for improving ASR performance.