Fast communication: Design of multichannel frequency domain statistical-based enhancement systems preserving spatial cues via spectral distances minimization

  • Authors:
  • F. MustièRe;M. Bouchard;H. Najaf-Zadeh;R. Pichevar;L. Thibault;H. Saruwatari

  • Affiliations:
  • Communications Research Centre, 3701 Carling Avenue, Ottawa, Ontario, Canada K2H 8S2 and School of Electrical Engineering and Computer Science, 800 King Edward Avenue, Ottawa, Ontario, Canada K1N ...;School of Electrical Engineering and Computer Science, 800 King Edward Avenue, Ottawa, Ontario, Canada K1N 6N5;Communications Research Centre, 3701 Carling Avenue, Ottawa, Ontario, Canada K2H 8S2;Communications Research Centre, 3701 Carling Avenue, Ottawa, Ontario, Canada K2H 8S2;Communications Research Centre, 3701 Carling Avenue, Ottawa, Ontario, Canada K2H 8S2;Graduate School of Information Science, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma-Shi, Nara 630-0192, Japan

  • Venue:
  • Signal Processing
  • Year:
  • 2013

Quantified Score

Hi-index 0.08

Visualization

Abstract

It is often very important for multichannel speech enhancement systems, such as hearing aids, to preserve spatial impressions. Usually, this is achieved by first designing a particular speech enhancement algorithm and later or separately constraining the obtained solution to respect spatial cues. Instead, we propose in this paper to conduct the entire system's design via the minimization of statistical spectral distances seen as functions of a real-valued, common gain to be applied to all channels in the frequency-domain. For various spectral distances, we show that the gain derived is expressible in terms of optimal multichannel spectral amplitude estimators (such as the multichannel Minimum Mean Squared Error Spectral Amplitude Estimator, among others). In addition, we report experimental results in complex environments (i.e., including reverberation, interfering talkers, and low signal-to-noise ratio), showing the potential of the proposed methods against recent state-of-the-art multichannel enhancement setups which preserve spatial cues as well.