Fundamentals of speech recognition
Fundamentals of speech recognition
An architecture and applications for speech-based accessibility systems
IBM Systems Journal
EURASIP Journal on Applied Signal Processing
Fusion of acoustic and tokenization features for speaker recognition
ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
Hi-index | 0.00 |
This paper presents performance evaluation of voice activity detectors (VAD) by long-term spectral divergence and simple energy-based scheme. Evaluation is made in the terms of false accept (FA) and false reject (FR) errors using four different types of materials, recorded under different transfer channels, scenarios and conditions. Performance of VADs is considered for forensics, speaker recognition and interactive speech dialogue applications. Performance is still far from perfect, but despite the numerous classification errors of the methods tested, especially with noisy data, the methods can be still useful.