Non-parametric techniques for pitch-scale and time-scale modification of speech
Speech Communication - Special issue: voice conversion: state of the art and perspectives
On the use of prosody in automatic dialogue understanding
Speech Communication - Dialogue and prosody
Nonlinear Time Series Analysis
Nonlinear Time Series Analysis
Pitch marks at peaks or valleys?
TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
Microintonation analysis of emotional speech
COST'09 Proceedings of the Second international conference on Development of Multimodal Interfaces: active Listening and Synchrony
Hi-index | 0.00 |
A novel approach for pitch mark determination based on dynamical systems theory is presented. Pitch marks are used for speech analysis and modification, such as jitter measurement or time scale modification. The algorithm works in a pseudo-state space and calculates the Poincare section at a chosen point in the state space. Pitch marks are then found at the crossing of the trajectories with the Poincare plane of the initial point. The procedure is performed frame-wise to account for the changing dynamics of the speech production system. The system is intended for real-time use, so higher-level processing extending over more than one frame is not used. The processing delay is, therefore, limited to one frame. The algorithm is evaluated by calculating an average pitch value for 10ms frames and using a small database with pitch measurements from a laryngograph signal. The results are compared to a reference correlation-based pitch mark algorithm. The performance of the proposed algorithm is comparable to the reference algorithm, but in contrast correctly follows the pitch marks of diplophonic voices.