Modeling the Short Time Fourier Transform Ratio and Application to Underdetermined Audio Source Separation

Authors:
Dinh-Tuan Pham;Zaher El-Chami;Alexandre Guérin;Christine Servière
Affiliations:
Laboratory Jean Kuntzmann, CNRS - INPG - UJF Grenoble, France;Orange Labs, Lannion, France;Orange Labs, Lannion, France;GIPSA-lab, CNRS - INPG Grenoble, France
Venue:
ICA '09 Proceedings of the 8th International Conference on Independent Component Analysis and Signal Separation
Year:
2009

Citing 1
Cited 0

Blind separation of speech mixtures via time-frequency masking

IEEE Transactions on Signal Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents the theoretical background for the Model Based Underdetermined Source Separation presented in [5]. We show that for a given frequency band, in contrast to customary assumption, the observed Short-Time Fourier Transform (STFT) ratio coming from one source is not constant in time, but is a random variable whose distribution we have obtained. Using this distribution and the Time-Frequency (TF) "disjoint" assumption of sources, we are able to obtain promising results in separating four audio sources from two microphones in a real reverberant room.