Limited-Vocabulary Estonian Continuous Speech Recognition System using Hidden Markov Models

  • Authors:
  • Tanel Alumäe;Leo Võhandu

  • Affiliations:
  • Tallinn Technical University, Ehitajate tee 5, 19086 Tallinn, Estonia, e-mail: tanel.alumae@phon.ioc.ee, leov@staff.ttu.ee;Tallinn Technical University, Ehitajate tee 5, 19086 Tallinn, Estonia, e-mail: tanel.alumae@phon.ioc.ee, leov@staff.ttu.ee

  • Venue:
  • Informatica
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

The article presents a limited-vocabulary speaker independentcontinuous Estonian speech recognition system based on hiddenMarkov models. The system is trained using an annotated Estonianspeech database of 60 speakers, approximately 4 hours in duration.Words are modelled using clustered triphones with multiple Gaussianmixture components. The system is evaluated using a numberrecognition task and a simple medium-vocabulary recognition task.The system performance is explored by employing acoustic models ofincreasing complexity. The number recognizer achieves an accuracyof 97%. The medium-vocabulary system recognizes 82.9% wordscorrectly if operating in real time. The correctness increases to90.6% if real-time requirement is discarded.