Voice activity detection using higher order statistics

  • Authors:
  • J. M. Górriz;J. Ramírez;J. C. Segura;S. Hornillo

  • Affiliations:
  • Dept. Teoría de la Sen̈al, Telemática y comunicaciones, Facultad de Ciencias, Universidad de Granada, Granada, Spain;Dept. Teoría de la Sen̈al, Telemática y comunicaciones, Facultad de Ciencias, Universidad de Granada, Granada, Spain;Dept. Teoría de la Sen̈al, Telemática y comunicaciones, Facultad de Ciencias, Universidad de Granada, Granada, Spain;Dept. Teoría de la Sen̈al, Telemática y comunicaciones, Facultad de Ciencias, Universidad de Granada, Granada, Spain

  • Venue:
  • IWANN'05 Proceedings of the 8th international conference on Artificial Neural Networks: computational Intelligence and Bioinspired Systems
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

A robust and effective voice activity detection (VAD) algorithm is proposed for improving speech recognition performance in noisy environments. The approach is based on filtering the input channel to avoid high energy noisy components and then the determination of the speech/non-speech bispectra by means of third order auto-cumulants. This algorithm differs from many others in the way the decision rule is formulated (detection tests) and the domain used in this approach. Clear improvements in speech/non-speech discrimination accuracy demonstrate the effectiveness of the proposed VAD. It is shown that application of statistical detection test leads to a better separation of the speech and noise distributions, thus allowing a more effective discrimination and a tradeoff between complexity and performance. The algorithm also incorporates a previous noise reduction block improving the accuracy in detecting speech and non-speech.