Baseline acoustic models for brazilian portuguese using CMU sphinx tools

Authors:
Rafael Oliveira;Pedro Batista;Nelson Neto;Aldebaro Klautau
Affiliations:
Signal Processing Laboratory, Federal University of Pará, Belém, PA, Brazil;Signal Processing Laboratory, Federal University of Pará, Belém, PA, Brazil;Signal Processing Laboratory, Federal University of Pará, Belém, PA, Brazil;Signal Processing Laboratory, Federal University of Pará, Belém, PA, Brazil
Venue:
PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
Year:
2012

Citing 3
Cited 0

Automatic clustering and generation of contextual questions for tied states in hidden Markov models

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
A Comparison between HTK and SPHINX on Chinese Mandarin

JCAI '09 Proceedings of the 2009 International Joint Conference on Artificial Intelligence
A baseline system for continuous speech recognition of Brazilian Prtuguese using the west point Brazilian Portuguese speech corpus

PROPOR'10 Proceedings of the 9th international conference on Computational Processing of the Portuguese Language

Quantified Score

Hi-index	0.00

Visualization

Abstract

Advances in speech processing research rely on the availability of public resources such as corpora, statistical models and baseline systems. In contrast to languages such as English, there are few specific resources for Brazilian Portuguese. This work describes efforts aiming to decrease such gap. Baseline acoustic models for Brazilian Portuguese were built using the CMU Sphinx toolkit and public domain resources: speech corpora, phonetic dictionary and language model. Experiments were carried on for dictation and grammar tasks and the obtained results can be used to support further researches. Part of the trained acoustic models and a reference speech corpus were made publicly available.