A watermarking-based method for informed source separation of audio signals with a single sensor

Authors:
Mathieu Parvaix;Laurent Girin;Jean-Marc Brossier
Affiliations:
Grenoble Laboratory of Image, Speech Signal, and Automation, Grenoble Institute of Technology, Grenoble, France;Grenoble Laboratory of Image, Speech Signal, and Automation, Grenoble Institute of Technology, Grenoble, France;Grenoble Laboratory of Image, Speech Signal, and Automation, Grenoble Institute of Technology, Grenoble, France
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2010

Citing 15
Cited 4

Blind separation of sources, Part 1: an adaptive algorithm based on neuromimetic architecture

Signal Processing
Vector quantization and signal compression

Vector quantization and signal compression
Modeling of non-Gaussian array data using cumulants: DOA estimation of more sources with less sensors

Signal Processing
Computational auditory scene analysis

Computational auditory scene analysis
Blind Source Separation by Sparse Decomposition in a Signal Dictionary

Neural Computation
Separating more sources than sensors using time-frequency distributions

EURASIP Journal on Applied Signal Processing
Using pitch, amplitude modulation, and spatial cues for separation of harmonic instruments from stereo music recordings

EURASIP Journal on Applied Signal Processing
Harmonic decomposition of audio signals with matching pursuit

IEEE Transactions on Signal Processing
Blind separation of speech mixtures via time-frequency masking

IEEE Transactions on Signal Processing
Underdetermined Blind Separation of Nondisjoint Sources in the Time-Frequency Domain

IEEE Transactions on Signal Processing
Performance measurement in blind audio source separation

IEEE Transactions on Audio, Speech, and Language Processing
A method for separation of overlapping partials based on similarity of temporal envelopes in multichannel mixtures

IEEE Transactions on Audio, Speech, and Language Processing
Sparse and structured decompositions of signals with the molecular matching pursuit

IEEE Transactions on Audio, Speech, and Language Processing
Quantization index modulation: a class of provably good methods for digital watermarking and information embedding

IEEE Transactions on Information Theory
Secure spread spectrum watermarking for multimedia

IEEE Transactions on Image Processing

Informed source separation using latent components

LVA/ICA'10 Proceedings of the 9th international conference on Latent variable analysis and signal separation
An adaptive robust watermarking algorithm for audio signals using SVD

Transactions on computational science X
Informed source separation through spectrogram coding and data embedding

Signal Processing
A blind digital audio watermarking scheme based on EMD and UISA techniques

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, the issue of audio source separation from a single channel is addressed, i.e., the estimation of several source signals from a single observation of their mixture. This challenging problem is tackled with a specific two levels coder-decoder configuration. At the coder, source signals are assumed to be available before the mix is processed. Each source signal is characterized by a set of parameters that provide additional information useful for separation. We propose an original method using a watermarking technique to imperceptibly embed this information about the source signals into the mix signal. At the decoder, the watermark is extracted from the mix signal to enable an end-user who has no access to the original sources to separate these signals from their mixture. Hence, we call this separation process informed source separation (ISS). Thereby, several instruments or voice signals can be segregated from a single piece of music to enable post-mixing processing such as volume control, echo addition, spatialization, or timbre transformation. Good performances are obtained for the separation of up to four source signals, from mixtures of speech or music signals. Promising results open up new perspectives in both under-determined source separation and audio watermarking domains.