Multilingual speech recognition for information retrieval in Indian context

  • Authors:
  • N. Udhyakumar;R. Swaminathan;S. K. Ramakrishnan

  • Affiliations:
  • Amrita Institute of Technology and Science, Coimbatore, Tamilnadu, India;Amrita Institute of Technology and Science, Coimbatore, Tamilnadu, India;Amrita Institute of Technology and Science, Coimbatore, Tamilnadu, India

  • Venue:
  • HLT-SRWS '04 Proceedings of the Student Research Workshop at HLT-NAACL 2004
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper analyzes various issues in building a HMM based multilingual speech recognizer for Indian languages. The system is originally designed for Hindi and Tamil languages and adapted to incorporate Indian accented English. Language-specific characteristics in speech recognition frame work are highlighted. The recognizer is embedded in information retrieval applications and hence several issues like handling spontaneous telephony speech in real-time, integrated language identification for interactive response and automatic grapheme to phoneme conversion to handle Out Of Vocabulary words are addressed. Experiments to study relative effectiveness of different algorithms have been performed and the results are investigated.