Bayesian Nonparametrics for Microphone Array Processing

  • Authors:
  • Takuma Otsuka;Katsuhiko Ishiguro;Hiroshi Sawada;Hiroshi G. Okuno

  • Affiliations:
  • Grad. Sch. of Inf., Kyoto Univ., Kyoto, Japan;NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan;NTT Service Evolution Labs., NTT Corp., Yokosuka, Japan;Grad. Sch. of Inf., Kyoto Univ., Kyoto, Japan

  • Venue:
  • IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

Sound source localization and separation from a mixture of sounds are essential functions for computational auditory scene analysis. The main challenges are designing a unified framework for joint optimization and estimating the sound sources under auditory uncertainties such as reverberation or unknown number of sounds. Since sound source localization and separation are mutually dependent, their simultaneous estimation is required for better and more robust performance. A unified model is presented for sound source localization and separation based on Bayesian nonparametrics. Experiments using simulated and recorded audio mixtures show that a method based on this model achieves state-of-the-art sound source separation quality and has more robust performance on the source number estimation under reverberant environments.