Stereo audio source separation based on time--frequency masking and multilevel thresholding

Authors:
Maximo Cobos;José J. López
Affiliations:
Technical University of Valencia, Institute for Telecommunications and Multimedia Applications (iTEAM), Camino de Vera s/n, Valencia, Spain;Technical University of Valencia, Institute for Telecommunications and Multimedia Applications (iTEAM), Camino de Vera s/n, Valencia, Spain
Venue:
Digital Signal Processing
Year:
2008

Citing 5
Cited 1

Blind separation of disjoint orthogonal signals: demixing N sources from 2 mixtures

ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 05
Using pitch, amplitude modulation, and spatial cues for separation of harmonic instruments from stereo music recordings

EURASIP Journal on Applied Signal Processing
Blind separation of speech mixtures via time-frequency masking

IEEE Transactions on Signal Processing
Performance measurement in blind audio source separation

IEEE Transactions on Audio, Speech, and Language Processing
Separation of Singing Voice From Music Accompaniment for Monaural Recordings

IEEE Transactions on Audio, Speech, and Language Processing

A Uniform Framework for Ad-Hoc Indexes to Answer Reachability Queries on Large Graphs

DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Source separation and up-mixing in real commercial music recordings is a challenging problem. In the last few years, some algorithms have provided interesting results, but the problem remains unsolved. In this paper we describe a method for separating the sources present in a two channel mixture based on the panning coefficients used in the stereo mixdown. The sources are separated by estimating time-frequency masks using the multilevel extension of the Otsu thresholding algorithm used in image segmentation. A refinement step is also carried out for extraction and reassignment of inter-source residuals. Examples of application and performance evaluation are also discussed.