A Classification EM algorithm for clustering and two stochastic versions
Computational Statistics & Data Analysis - Special issue on optimization techniques in statistics
Model-based expectation-maximization source separation and localization
IEEE Transactions on Audio, Speech, and Language Processing
Blind separation of speech mixtures via time-frequency masking
IEEE Transactions on Signal Processing
Performance measurement in blind audio source separation
IEEE Transactions on Audio, Speech, and Language Processing
The cocktail party robot: sound source separation and localisation with an active binaural head
HRI '12 Proceedings of the seventh annual ACM/IEEE international conference on Human-Robot Interaction
Hi-index | 0.00 |
We present a method for audio source separation and localization from binaural recordings. The method combines a new generative probabilistic model with time-frequency masking. We suggest that device-dependent relationships between point-source positions and interaural spectral cues may be learnt in order to constrain a mixture model. This allows to capture subtle separation and localization features embedded in the auditory data. We illustrate our method with data composed of two and three mixed speech signals in the presence of reverberations. Using standard evaluation metrics, we compare our method with a recent binaural-based source separation-localization algorithm.