A multistage approach for blind separation of convolutive speech mixtures

Authors:
Tariqullah Jan;Wenwu Wang;DeLiang Wang
Affiliations:
Centre for Vision, Speech and Signal Processing, University of Surrey, UK;Centre for Vision, Speech and Signal Processing, University of Surrey, UK;Department of Computer Science and Engineering&Centre for Cognitive Science, The Ohio State University, Columbus, USA
Venue:
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Year:
2009

Citing 0
Cited 2

A multistage approach to blind separation of convolutive speech mixtures

Speech Communication
Analysis of two-sensors forward BSS structure with post-filters in the presence of coherent and incoherent noise

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we propose a novel algorithm for the separation of convolutive speech mixtures using two-microphone recordings, based on the combination of independent component analysis (ICA) and ideal binary mask (IBM), together with a post-filtering process in the cepstral domain. Essentially, the proposed algorithm consists of three steps. First, a constrained convolutive ICA algorithm is applied to separate the source signals from two-microphone recordings. In the second step, we estimate the IBM by comparing the energy of corresponding time-frequency (T-F) units from the separated sources obtained with the convolutive ICA algorithm. The last step is to reduce musical noise caused typically by T-F masking using cepstral smoothing. The performance of the proposed approach is evaluated based on both reverberant mixtures generated using a simulated room model and real recordings. The proposed algorithm offers considerably higher efficiency, together with improved speech quality while producing similar separation performance as compared with a recent approach.