Audio source separation using hierarchical phase-invariant models

  • Authors:
  • Emmanuel Vincent

  • Affiliations:
  • INRIA, Centre Inria Rennes, Rennes Cedex, France

  • Venue:
  • NOLISP'09 Proceedings of the 2009 international conference on Advances in Nonlinear Speech Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Audio source separation consists of analyzing a given audio recording so as to estimate the signal produced by each sound source for listening or information retrieval purposes. In the last five years, algorithms based on hierarchical phase-invariant models such as single- or multichannel hidden Markov models (HMMs) or nonnegative matrix factorization (NMF) have become popular. In this paper, we provide an overview of these models and discuss their advantages compared to established algorithms such as nongaussianity-based frequency-domain independent component analysis (FDICA) and sparse component analysis (SCA) for the separation of complex mixtures involving many sources or reverberation. We argue how hierarchical phase-invariant modeling could form the basis of future modular source separation systems.