Silence/Speech detection method based on set of decision graphs
TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
Hi-index | 0.00 |
The ETSI AMR-2 VAD is rigorously evaluated in clean and noisy conditions. The VAD is then simplified and optimized for porting to an ultra low-resource DSP system using a fast oversampled DFT filterbank. The parameters of the low-resource VAD are optimized using two speakers and 6 types of noise at SNRs from -10 to 20 dB. The VAD is then tested by employing sentences from two other speakers and 12 different types of noise. Results show that the low-resource VAD offers a performance comparable to that of the ETSI VAD in both clean and noisy conditions. When deployed on a custom DSP running at a clock speed of 1.28 MHz and consuming less than 1 milliWatt of power, the low-resource VAD uses less than 30% of the available system resources.