Multilingual Weighted Codebooks for Non-native Speech Recognition

Authors:
Martin Raab;Rainer Gruhn;Elmar Nöth
Affiliations:
Harman Becker Automotive Systems, Speech Dialog Systems, Ulm, Germany and Dept. of Pattern Recognition, University of Erlangen, Erlangen, Germany;Harman Becker Automotive Systems, Speech Dialog Systems, Ulm, Germany and Dept. of Information Technology, University of Ulm, Ulm, Germany;Dept. of Pattern Recognition, University of Erlangen, Erlangen, Germany
Venue:
TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
Year:
2008

Citing 2
Cited 1

Multilingual phone models for vocabulary-independent speech recognition tasks

Speech Communication
Language-independent and language-adaptive acoustic modeling for speech recognition

Speech Communication

A scalable architecture for multilingual speech recognition on embedded devices

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

In many embedded systems commands and other words in the user's main language must be recognized with maximum accuracy, but it should be possible to use foreign names as they frequently occur in music titles or city names. Example systems with constrained resources are navigation systems, mobile phones and MP3 players.Speech recognizers on embedded systems are typically semi-continuous speech recognizers based on vector quantization. Recently we introduced Multilingual Weighted Codebooks (MWCs) for such systems. Our previous work shows significant improvements for the recognition of multiple native languages. However, open questions remained regarding the performance on non-native speech.We evaluate on four different non-native accents of English, and our MWCs produce always significantly better results than a native English codebook. Our best result is a 4.4% absolute word accuracy improvement. Further experiments with non-native accented speech give interesting insights in the attributes of non-native speech in general.