Integration of acoustic and visual speech signals using neural networks

Authors:
B. P. Yuhas;M. H. Goldstein, Jr.;T. J. Sejnowski
Affiliations:
Johns Hopkins Univ., Baltimore, MD;-;-
Venue:
IEEE Communications Magazine
Year:
1989

Citing 0
Cited 2

Improving connected letter recognition by lipreading

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: plenary, special, audio, underwater acoustics, VLSI, neural networks - Volume I
Automatic visual speech segmentation and recognition using directional motion history images and Zernike moments

The Visual Computer: International Journal of Computer Graphics

Quantified Score

Hi-index	0.25

Visualization

Abstract

Results from a series of experiments that use neural networks to process the visual speech signals of a male talker are presented. In these preliminary experiments, the results are limited to static images of vowels. It is demonstrated that these networks are able to extract speech information from the visual images and that this information can be used to improve automatic vowel recognition. The structure of speech and its corresponding acoustic and visual signals are reviewed. The specific data that was used in the experiments along with the network architectures and algorithms are described. The results of integrating the visual and auditory signals for vowel recognition in the presence of acoustic noise are presented