Dynamic Language Modeling for the European Portuguese

  • Authors:
  • Ciro Martins;António Teixeira;João Neto

  • Affiliations:
  • Department Electronics, Telecommunications & Informatics/IEETA, Aveiro University, and L2F --- Spoken Language Systems Lab, INESC-ID/IST, Lisbon,;Department Electronics, Telecommunications & Informatics/IEETA, Aveiro University,;L2F --- Spoken Language Systems Lab, INESC-ID/IST, Lisbon,

  • Venue:
  • PROPOR '08 Proceedings of the 8th international conference on Computational Processing of the Portuguese Language
  • Year:
  • 2008

Quantified Score

Hi-index 0.01

Visualization

Abstract

Up-to-date language modeling is recognized to be a critical aspect of maintaining the level of performance for a speech recognizer over time for most applications. In particular for applications such as transcription of broadcast news and conversations where the occurrence of new words is very frequent, especially for highly inflected languages like the European Portuguese. An unsupervised adaptation approach, which dynamically adapts the active vocabulary and language model during a multi-pass speech recognition process, is presented. Experimental results confirmed the adequacy of the proposed approaches. Experiments were carried out for a European Portuguese Broadcast News transcription system with the best preliminary results yielding a relative reduction of 65.2% in OOV word rate and 6.6% in WER.