Fundamental limitation of frequency domain blind source separation for convolutive mixture of speech

Authors:
S. Araki;S. Makino;T. Nishikawa;H. Saruwatari
Affiliations:
NTT Commun. Sci. Labs., Kyoto, Japan;-;-;-
Venue:
ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 05
Year:
2001

Citing 0
Cited 10

Blind source separation in frequency domain

Signal Processing
Equivalence between frequency-domain blind source separation and frequency-domain adaptive beamforming for convolutive mixtures

EURASIP Journal on Applied Signal Processing
Step-size parameter adaptation of multi-channel semi-blind ICA with piecewise linear model for barge-in-able robot audition

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Robust independent component analysis using quadratic negentropy

ICA'07 Proceedings of the 7th international conference on Independent component analysis and signal separation
Blind vector deconvolution: convolutive mixture models in short-time fourier transform domain

ICA'07 Proceedings of the 7th international conference on Independent component analysis and signal separation
Multichannel blind separation and deconvolution of images for document analysis

IEEE Transactions on Image Processing
Blind source separation using variable step-size adaptive algorithm in frequency domain

SSIP'05 Proceedings of the 5th WSEAS international conference on Signal, speech and image processing
Post-processing for enhancing target signal in frequency domain blind source separation

MRCS'06 Proceedings of the 2006 international conference on Multimedia Content Representation, Classification and Security
Letters: Blind source separation based on independent vector analysis using feed-forward network

Neurocomputing
Blind Deconvolution of Sources in Fourier Space Based on Generalized Laplace Distribution

International Journal of System Dynamics Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Despite several recent proposals to achieve blind source separation (BSS) for realistic acoustic signals, separation performance is still not good enough. In particular, when the length of impulse response is long, performance is highly limited. We show it is useless to be constrained by the condition, P /spl Lt/ T, where T is the frame size of FFT and P is the length of room impulse response. From our experiments. a frame size of 256 or 512 (32 or 64 ms at a sampling frequency of 8 kHz) is best even for the long room reverberation of T/sub R/ = 150 and 300 ms. We also clarified the reason for poor performance of BSS in a long reverberant environment, finding that separation is achieved chiefly for the sound from the direction of jammers because BSS cannot calculate the inverse of the room transfer function both for the target and jammer signals.