Comparison of fixed and variable weight approaches for viseme classification

Authors:
P. Patel;K. Ouazzane
Affiliations:
London Metropolitan University, London, UK;London Metropolitan University, London, UK
Venue:
SIP '07 Proceedings of the Ninth IASTED International Conference on Signal and Image Processing
Year:
2007

Citing 4
Cited 0

Automatic lipreading to enhance speech recognition (speech reading)

Automatic lipreading to enhance speech recognition (speech reading)
Boosted Audio-Visual HMM for Speech Reading

AMFG '03 Proceedings of the IEEE International Workshop on Analysis and Modeling of Faces and Gestures
A support vector machine-based dynamic network for visual speech recognition applications

EURASIP Journal on Applied Signal Processing
Eigenfaces for recognition

Journal of Cognitive Neuroscience

Quantified Score

Hi-index	0.00

Visualization

Abstract

Several researchers have demonstrated that a visual speech reading system is beneficial complement to an audio speech recognition system by using of visual speech cues of the speakers face in noisy environment. However, robust and accurate visual feature extraction and classification are difficult object recognition and classification problems, due to high variation in pose, lighting and dynamic nature of the visemes. In this paper, a novel variable weights approach for classifying visemes is presented and compared with fixed weights based classification approach. Firstly, an approach using fixed significance factors (weights) for various components of visemes including mouth gestures is employed for visemes classification. The approach assumes that all visual features have same significance factor for every phoneme. The second approach is based on the hypothesis that the significance of a visual feature is variable for different phonemes. The efficiency of the variable weights approach is evaluated by comparing its results with fixed weights algorithm findings. The recognition results indicate that the variable weight approach has better performance than the fixed weight approach. The results presented demonstrate a highly accurate viseme classification approach with an average alphabet detection rate of about 36.9%.Furthermore, on average around 53% of alphabets were accurately detected using the viseme classifier described in this study.