Exploring permutation inconsistency in blind separation of speech signals in a reverberant environment

Authors:
M. Z. Ikram;D. R. Morgan
Affiliations:
Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA;-
Venue:
ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 02
Year:
2000

Citing 0
Cited 8

Identification of acoustic MIMO systems: challenges and opportunities

Signal Processing
Fast fixed-point independent vector analysis algorithms for convolutive blind source separation

Signal Processing
Equivalence between frequency-domain blind source separation and frequency-domain adaptive beamforming for convolutive mixtures

EURASIP Journal on Applied Signal Processing
Geometrical interpretation of the PCA subspace approach for overdetermined blind source separation

EURASIP Journal on Applied Signal Processing
Multichannel eigenspace beamforming in a reverberant noisy environment with multiple interfering speech signals

IEEE Transactions on Audio, Speech, and Language Processing
A Partitioned Frequency Block Algorithm for Blind Separation in Reverberant Environments

Proceedings of the 2009 conference on Neural Nets WIRN09: Proceedings of the 19th Italian Workshop on Neural Nets, Vietri sul Mare, Salerno, Italy, May 28--30 2009
A Pre-Filtering and Post-Filtering Approach to Blind Source Separation

Proceedings of the 2011 conference on Neural Nets WIRN10: Proceedings of the 20th Italian Workshop on Neural Nets
Post-processing for enhancing target signal in frequency domain blind source separation

MRCS'06 Proceedings of the 2006 international conference on Multimedia Content Representation, Classification and Security

Quantified Score

Hi-index	0.00

Visualization

Abstract

We study and explore the limitations of methods for blind separation of a mixture of multiple speakers in a real reverberant environment. To support our results, we analyze a frequency-domain method, which achieves blind source separation (BSS) by transforming the time-domain convolutive problem to multiple short-term problems in the frequency domain. We show that treating the problem independently at different frequency bins introduces a "permutation inconsistency" problem, which becomes worse as the length of room impulse response increases. Our studies prove that the ideas proposed in the existing literature are not capable of effectively handling this problem and a need exists for its satisfactory solution. We speculate that time-domain BSS techniques may also suffer from an equivalent permutation inconsistency problem when long un-mixing filters are used.