Consistent wiener filtering: generalized time-frequency masking respecting spectrogram consistency

  • Authors:
  • Jonathan Le Roux;Emmanuel Vincent;Yuu Mizuno;Hirokazu Kameoka;Nobutaka Ono;Shigeki Sagayama

  • Affiliations:
  • NTT Communication Science Laboratories, NTT Corporation, Atsugi, Kanagawa, Japan;INRIA, Centre Inria Rennes - Bretagne Atlantique, Rennes Cedex, France;Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan;NTT Communication Science Laboratories, NTT Corporation, Atsugi, Kanagawa, Japan;Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan;Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan

  • Venue:
  • LVA/ICA'10 Proceedings of the 9th international conference on Latent variable analysis and signal separation
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Wiener filtering is one of the most widely used methods in audio source separation. It is often applied on time-frequency representations of signals, such as the short-time Fourier transform (STFT), to exploit their short-term stationarity, but so far the design of the Wiener time-frequency mask did not take into account the necessity for the output spectrograms to be consistent, i.e., to correspond to the STFT of a time-domain signal. In this paper, we generalize the concept of Wiener filtering to time-frequency masks which can involve manipulation of the phase as well by formulating the problem as a consistency-constrained Maximum-Likelihood one. We present two methods to solve the problem, one looking for the optimal time-domain signal, the other promoting consistency through a penalty function directly in the time-frequency domain. We show through experimental evaluation that, both in oracle conditions and combined with spectral subtraction, our method outperforms classical Wiener filtering.