Cross-domain speech disfluency detection

  • Authors:
  • Kallirroi Georgila;Ning Wang;Jonathan Gratch

  • Affiliations:
  • University of Southern California, Playa Vista, CA;University of Southern California, Playa Vista, CA;University of Southern California, Playa Vista, CA

  • Venue:
  • SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We build a model for speech disfluency detection based on conditional random fields (CRFs) using the Switchboard corpus. This model is then applied to a new domain without any adaptation. We show that a technique for detecting speech disfluencies based on Integer Linear Programming (ILP) (Georgila, 2009) significantly outperforms CRFs. In particular, in terms of F-score and NIST Error Rate the absolute improvement of ILP over CRFs exceeds 20% and 25% respectively. We conclude that ILP is an approach with great potential for speech disfluency detection when there is a lack or shortage of indomain data for training.