Collecting and evaluating speech recognition corpora for nine Southern Bantu languages

Authors:
Jaco Badenhorst;Charl van Heerden;Marelie Davel;Etienne Barnard
Affiliations:
Meraka Institute, CSIR, South Africa;Meraka Institute, CSIR, South Africa;Meraka Institute, CSIR, South Africa;Meraka Institute, CSIR, South Africa
Venue:
AfLaT '09 Proceedings of the First Workshop on Language Technologies for African Languages
Year:
2009

Citing 6
Cited 1

Introduction to statistical pattern recognition (2nd ed.)

Introduction to statistical pattern recognition (2nd ed.)
Language-independent and language-adaptive acoustic modeling for speech recognition

Speech Communication
Towards language independent acoustic modeling

ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 02
Language-dependent state clustering for multilingual acoustic modelling

Speech Communication
Pronunciation prediction with Default&Refine

Computer Speech and Language
HIV health information access using spoken dialogue systems: touchtone vs. speech

ICTD'09 Proceedings of the 3rd international conference on Information and communication technologies and development

Subword variation in text message classification

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

We describe the Lwazi corpus for automatic speech recognition (ASR), a new telephone speech corpus which includes data from nine Southern Bantu languages. Because of practical constraints, the amount of speech per language is relatively small compared to major corpora in world languages, and we report on our investigation of the stability of the ASR models derived from the corpus. We also report on phoneme distance measures across languages, and describe initial phone recognisers that were developed using this data.