Separation of mixed audio signals by source localization and binary masking with hilbert spectrum

Authors:
Md. Khademul Islam Molla;Keikichi Hirose;Nabuaki Minematsu
Affiliations:
Graduate School of Frontier Sciences;Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan;Graduate School of Frontier Sciences
Venue:
ICA'06 Proceedings of the 6th international conference on Independent Component Analysis and Blind Signal Separation
Year:
2006

Citing 1
Cited 0

Blind separation of speech mixtures via time-frequency masking

IEEE Transactions on Signal Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Hilbert transformation together with empirical mode decomposition (EMD) produces Hilbert spectrum (HS) which is a fine-resolution time-frequency (TF) representation of any nonlinear and non-stationary signal. A method of audio signal separation from stereo mixtures based on the spatial location of the sources is presented in this paper. The TF representation of the audio signal is obtained by HS. The sources are localized in the space of time and intensity differences between two microphones’ signals. The separation is performed by masking the target signal in TF domain considering that the sources are disjoint orthogonal. The experimental results of the proposed method show a noticeable improvement of separation efficiency.