A robust voice activity detector for wireless communications using soft computing

Authors:
F. Beritelli;S. Casale;A. Cavallaero
Affiliations:
Ist. di Inf. e Telecommun., Catania Univ.;-;-
Venue:
IEEE Journal on Selected Areas in Communications
Year:
2006

Citing 0
Cited 13

Hybrid multimode/multirate CS-ACELP speech coding for adaptive voice over IP

Speech Communication
Intrastandard Hybrid Speech Coding for Adaptive IP Telephony

QoS-IP '01 Proceedings of the International Workshop on Quality of Service in Multiservice IP Networks
Voice activity detection based on a family of parametric distributions

Pattern Recognition Letters
Noise estimation using speech/non-speech frame decision and subband spectral tracking

Speech Communication
Speech/nonspeech detection using minimal walsh basis functions

EURASIP Journal on Audio, Speech, and Music Processing
A semi-continuous state-transition probability HMM-based voice activity detector

EURASIP Journal on Audio, Speech, and Music Processing
Voice activity detection based on statistical models and machine learning approaches

Computer Speech and Language
Holonic multi-agent system model for fuzzy automatic speech / speaker recognition

KES-AMSTA'08 Proceedings of the 2nd KES International conference on Agent and multi-agent systems: technologies and applications
Voice activity detection based on using wavelet packet

Digital Signal Processing
Recurrent type-2 fuzzy neural network using Haar wavelet energy and entropy features for speech detection in noisy environments

Expert Systems with Applications: An International Journal
A portable medical system using real-time streaming transport over 3G wireless networks

Journal of Real-Time Image Processing
Fuzzy logic speech/non-speech discrimination for noise robust speech processing

ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part I
Robust emotional speech classification in the presence of babble noise

International Journal of Speech Technology

Quantified Score

Hi-index	0.07

Visualization

Abstract

Discontinuous transmission based on speech/pause detection represents a valid solution to improve the spectral efficiency of new generation wireless communication systems. In this context, robust voice activity detection (VAD) algorithms are required, as traditional solutions present a high misclassification rate in the presence of the background noise typical of mobile environments. This paper presents a voice detection algorithm which is robust to noisy environments, thanks to a new methodology adopted for the matching process. More specifically, the VAD proposed is based on a pattern recognition approach in which the matching phase is performed by a set of six fuzzy rules, trained by means of a new hybrid learning tool. A series of objective tests performed on a large speech database, varying the signal-to-noise ratio (SNR), the types of background noise, and the input signal level, showed that, as compared with the VAD standardized by ITU-T in Recommendation G.729 annex B, the fuzzy VAD, on average, achieves an improvement in reduction both of the activity factor of about 25% and of the clipping introduced of about 43%. Informal listening tests also confirm an improvement in the perceived speech quality