Speech recognition enhancement by lip information

  • Authors:
  • S. Nishida

  • Affiliations:
  • Central Research Lab., Mitsubishi Electric Corp., Tsukaguchi-Hommachi 8-1-1, Amagasaki, Hyogo, 661, Japan and Media Laboratory, MIT, Cambridge, MA

  • Venue:
  • CHI '86 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
  • Year:
  • 1986

Quantified Score

Hi-index 0.00

Visualization

Abstract

Though technology in speech recognition has progressed recently, Automatic Speech Recognition (ASR) is vulnerable to noise. Lip-information is thought to be useful for speech recognition in noisy situations, such as in a factory or in a car.This paper describes speech recognition enhancement by lip-information. Two types of usage are dealt with. One is the detection of start and stop of speech from lip-information. This is the simplest usage of lip-information. The other is lip-pattern recognition, and it is used for speech recognition together with sound information. The algorithms for both usages are proposed, and the experimental system shows they work well. The algorithms proposed here are composed of simple image-processing. Future progress in image-processing will make it possible to realize them in real-time.