UBM-based incremental speaker adaptation

  • Authors:
  • Yi-Wen Liu;Lie Lu;Ke Chen;Hong-Jiang Zhang

  • Affiliations:
  • Center for Inf. Sci., Peking Univ., Beijing, China;Center for Comput. Res. in Music & Acousti., Stanford Univ., CA, USA;Inst. for Human-Comput. Commun., Technische Univ. Munchen, Germany;Samsung Electron. Co. Ltd., Suwon, South Korea

  • Venue:
  • ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 1
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper addresses a novel algorithm of incremental speaker adaptation (ISA) based on universal background model (UBM) for saving storage and real-time processing. This algorithm can be seen as an extension of traditional speaker adaptation. It consists of two steps, adaptation and combination. It not only considers the speaker's characteristics in limited training data, but also prohibits over-fitting of the updated model. The incremental adaptation algorithm needs little storage and meets the requirement of real-time processing. In order to evaluate the efficiency and effectivity of the proposed approach, a real-time speaker segmentation system for broadcasting news is built. Experiment results demonstrate that our approach yields real time operation and achieves satisfactory performance.