Speech recognition on Mandarin Call Home: a large-vocabulary, conversational, and telephone speech corpus

  • Authors:
  • Fu-Hua Liu;M. Picheny;P. Srinivasa;M. Monkowski;J. Chen

  • Affiliations:
  • Human Language Technol. Group, IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA;Human Language Technol. Group, IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA;Human Language Technol. Group, IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA;Human Language Technol. Group, IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA;Human Language Technol. Group, IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA

  • Venue:
  • ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe IBM's most recent efforts for speech recognition on a conversational-speech database, the Mandarin Call Home corpus. While it is similar to the well-known Switchboard corpus, the Call Home task addresses several major challenges in the domain of spoken language systems, including spontaneous dialogue with no pre-specified topics, limited-bandwidth telephone signal, and recognition of other languages than English. We particularly describe the methodology used in Mandarin Call Home corpus to address language-specific issues. We also examine and compare our results with those of the English Switchboard corpus. Preliminary experiments show that a 58.7% character error rate can be achieved in the context of April 95 Mandarin Call Home data set. The experimental results are comparable to those of the state-of-the-art IBM Switchboard system with similar amount of training data.