Speaker dependent ASRs for huastec and western-huastec náhuatl languages

Authors:
Juan A. Nolazco-Flores;Luis R. Salgado-Garza;Marco Peña-Díaz
Affiliations:
Departamento de Ciencias Computacionales, ITESM, Campus Monterrey, Monterrey, N.L, México;Departamento de Ciencias Computacionales, ITESM, Campus Monterrey, Monterrey, N.L, México;Departamento de Ciencias Computacionales, ITESM, Campus Monterrey, Monterrey, N.L, México
Venue:
IbPRIA'05 Proceedings of the Second Iberian conference on Pattern Recognition and Image Analysis - Volume Part II
Year:
2005

Citing 1
Cited 1

Discrete Time Processing of Speech Signals

Discrete Time Processing of Speech Signals

Phonetic decomposition for speech recognition of lesser-studied languages

Proceedings of the 2009 international workshop on Intercultural collaboration

Quantified Score

Hi-index	0.00

Visualization

Abstract

The purpose of this work is to show the results obtained when the latest technological advances in the area of Automatic Speech Recognition (ASR) are applied to the Western-Huastec Náhuatl and Huastec languages. Western-Huastec Náhuatl and Huastec are not only native (indigenous) languages in México, but also minority languages, and people who speak these languages usually are analphabetic. A speech database was created by recording the voice of native speaker when reading a set of documents used for native bilingual primary school in the official mexican state education system. A pronunciation dictionary was created for each language. A continuous Hidden Markov Models (HMM) were used for acoustical modeling, and bigrams were used for language Modeling. A Viterbi decoder was used for recognition. The word error rate of this task is below 8.621% for Western-Huastec Náhuatl language and 10.154% for Huastec language.