Influence of background noise and microphone on the performance of the IBM Tangora speech recognition system

  • Authors:
  • Subrata Das;Raimo Bakis;Arthur Nádas;David Nahamoo;Michael Picheny

  • Affiliations:
  • Department of Computer Sciences, IBM Thomas J. Watson Research Center, Yorktown Heights, NY;Department of Computer Sciences, IBM Thomas J. Watson Research Center, Yorktown Heights, NY;Department of Computer Sciences, IBM Thomas J. Watson Research Center, Yorktown Heights, NY;Department of Computer Sciences, IBM Thomas J. Watson Research Center, Yorktown Heights, NY;Department of Computer Sciences, IBM Thomas J. Watson Research Center, Yorktown Heights, NY

  • Venue:
  • ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
  • Year:
  • 1993

Quantified Score

Hi-index 0.00

Visualization

Abstract

With the intention of developing a robust speech recognizer largely immune to the vagaries of extrinsic changes, we investigated the consequences of various background noises and microphones on the performance of our Tangora system. We identified several noisy locations such as our cafeteria and our secretary's office and included a relatively quiet office for comparison. We recorded isolated-word training and test data from one male and one female speaker at different locations employing several varieties of microphones. A typical experiment consisted of designing a speaker-dependent HMM system with one set of training data and decoding the test data collected at all locations. We found that microphone characteristics had a significant impact on the robustness of our system. Another observation was that controlled contamination of the quiet training data with ambient noise improved the noise immunity of the recognizer, discounting the role of Lombard effect in our studies.