The SIGMA algorithm: a glottal activity detector for electroglottographic signals
IEEE Transactions on Audio, Speech, and Language Processing
Three dimensions of pitched instrument onset detection
IEEE Transactions on Audio, Speech, and Language Processing
Auditory spectrum-based pitched instrument onset detection
IEEE Transactions on Audio, Speech, and Language Processing
On the detection of pitch marks using a robust multi-phase algorithm
Speech Communication
Spoken emotion recognition using glottal symmetry
EURASIP Journal on Advances in Signal Processing - Special issue on emotion and mental state recognition from speech
Hi-index | 0.00 |
Measures based on the group delay of the LPC residual have been used by a number of authors to identify the time instants of glottal closure in voiced speech. In this paper, we discuss the theoretical properties of three such measures and we also present a new measure having useful properties. We give a quantitative assessment of each measure's ability to detect glottal closure instants evaluated using a speech database that includes a direct measurement of glottal activity from a Laryngograph/EGG signal. We find that when using a fixed-length analysis window, the best measures can detect the instant of glottal closure in 97% of larynx cycles with a standard deviation of 0.6 ms and that in 9% of these cycles an additional excitation instant is found that normally corresponds to glottal opening. We show that some improvement in detection rate may be obtained if the analysis window length is adapted to the speech pitch. If the measures are applied to the preemphasized speech instead of to the LPC residual, we find that the timing accuracy worsens but the detection rate improves slightly. We assess the computational cost of evaluating the measures and we present new recursive algorithms that give a substantial reduction in computation in all cases.