DISTBIC: a speaker-based segmentation for audio data indexing
Speech Communication - Special issue on accessing information in spoken audio
The M2VTS Multimodal Face Database (Release 1.00)
AVBPA '97 Proceedings of the First International Conference on Audio- and Video-Based Biometric Person Authentication
Endpoint detection of isolated utterances based on a modified Teager energy measurement
ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
Hi-index | 0.00 |
In this work, we model speech samples with a two-sided generalized Gamma distribution and evaluate its efficiency for voice activity detection. Using a computationally inexpensive maximum likelihood approach, we employ the Bayesian Information Criterion for identifying the phoneme boundaries in noisy speech.