Stereo audio source separation based on time--frequency masking and multilevel thresholding

  • Authors:
  • Maximo Cobos;José J. López

  • Affiliations:
  • Technical University of Valencia, Institute for Telecommunications and Multimedia Applications (iTEAM), Camino de Vera s/n, Valencia, Spain;Technical University of Valencia, Institute for Telecommunications and Multimedia Applications (iTEAM), Camino de Vera s/n, Valencia, Spain

  • Venue:
  • Digital Signal Processing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Source separation and up-mixing in real commercial music recordings is a challenging problem. In the last few years, some algorithms have provided interesting results, but the problem remains unsolved. In this paper we describe a method for separating the sources present in a two channel mixture based on the panning coefficients used in the stereo mixdown. The sources are separated by estimating time-frequency masks using the multilevel extension of the Otsu thresholding algorithm used in image segmentation. A refinement step is also carried out for extraction and reassignment of inter-source residuals. Examples of application and performance evaluation are also discussed.