Statistical model of speech signals based on composite autoregressive system with application to blind source separation

  • Authors:
  • Hirokazu Kameoka;Takuya Yoshioka;Mariko Hamamura;Jonathan Le Roux;Kunio Kashino

  • Affiliations:
  • NTT Communication Science Laboratories, NTT Corporation, Atsugi, Kanagawa, Japan;NTT Communication Science Laboratories, NTT Corporation, Atsugi, Kanagawa, Japan;NTT Communication Science Laboratories, NTT Corporation, Atsugi, Kanagawa, Japan;NTT Communication Science Laboratories, NTT Corporation, Atsugi, Kanagawa, Japan;NTT Communication Science Laboratories, NTT Corporation, Atsugi, Kanagawa, Japan

  • Venue:
  • LVA/ICA'10 Proceedings of the 9th international conference on Latent variable analysis and signal separation
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a new statistical model for speech signals, which consists of a time-invariant dictionary incorporating a set of the power spectral densities of excitation signals and a set of all-pole filters where the gain of each pair of excitation and filter elements is allowed to vary over time. We use this model to develop a combined blind separation and dereverberation method for speech. Reasonably good separations were obtained under a highly reverberant condition.