An audio-video based IVA algorithm for source separation and evaluation on the AV16.3 corpus

Authors:
Yanfeng Liang;Jonathon Chambers
Affiliations:
School of Electronic, Electrical and Systems Engineering, Loughborough University, UK;School of Electronic, Electrical and Systems Engineering, Loughborough University, UK
Venue:
LVA/ICA'12 Proceedings of the 10th international conference on Latent Variable Analysis and Signal Separation
Year:
2012

Citing 5
Cited 0

Adaptive Blind Signal and Image Processing: Learning Algorithms and Applications

Adaptive Blind Signal and Image Processing: Learning Algorithms and Applications
Fast fixed-point independent vector analysis algorithms for convolutive blind source separation

Signal Processing
AV16.3: an audio-visual corpus for speaker localization and tracking

MLMI'04 Proceedings of the First international conference on Machine Learning for Multimodal Interaction
Performance measurement in blind audio source separation

IEEE Transactions on Audio, Speech, and Language Processing
Blind Source Separation Exploiting Higher-Order Frequency Dependencies

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The machine cocktail party problem has been researched for several decades. Although many blind source separation schemes have been proposed to address this problem, few of them are tested by using a real room audio video recording. In this paper, we propose an audio video based independent vector analysis (AVIVA) method, and test it with other independent vector analysis methods by using a real room recording dataset, i.e. the AV16.3 corpus. Moreover, we also use a new method based on pitch difference detection for objective evaluation of the separation performance of the algorithms when applied on the real dataset which confirms advantages of using the visual modality with IVA.