Phase-based dual-microphone robust speech enhancement

Authors:
P. Aarabi;Guangji Shi
Affiliations:
Dept. of Electr. & Comput. Eng., Univ. of Toronto, Ont., Canada;-
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Year:
2004

Citing 0
Cited 8

A robust statistical-based speaker's location detection algorithm in a vehicular environment

EURASIP Journal on Applied Signal Processing
Noise cross PSD estimation using phase information in diffuse noise field

Signal Processing
The use of phase in complex spectrum subtraction for robust speech recognition

Computer Speech and Language
Speech enhancement using combination of dereverberation and noise reduction for robust speech recognition

Proceedings of the Second Symposium on Information and Communication Technology
The TORGO database of acoustic and articulatory speech from speakers with dysarthria

Language Resources and Evaluation
Real and imaginary modulation spectral subtraction for speech enhancement

Speech Communication
A coherence-based noise reduction algorithm for binaural hearing aids

Speech Communication
Noise variance estimation based on dual-channel phase difference for speech enhancement

Digital Signal Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

A dual-microphone speech-signal enhancement algorithm, utilizing phase-error based filters that depend only on the phase of the signals, is proposed. This algorithm involves obtaining time-varying, or alternatively, time-frequency (TF), phase-error filters based on prior knowledge regarding the time difference of arrival (TDOA) of the speech source of interest and the phases of the signals recorded by the microphones. It is shown that by masking the TF representation of the speech signals, the noise components are distorted beyond recognition while the speech source of interest maintains its perceptual quality. This is supported by digit recognition experiments which show a substantial recognition accuracy rate improvement over prior multimicrophone speech enhancement algorithms. For example, for a case with two speakers with a 0.1 s reverberation time, the phase-error based technique results in a 28.9% recognition rate gain over the single channel noisy signal, a gain of 22.0% over superdirective beamforming, and a gain of 8.5% over postfiltering.