Hybrid simulated annealing and its application to optimization of hidden Markov models for visual speech recognition

Authors:
Jong-Seok Lee;Cheol Hoon Park
Affiliations:
Institute of Electrical Engineering, Ecole Polytechnique Fédérale de Lausaone, Lausaone, Switzerland and School of Electrical Engineering and Computer Science, KAIST, Daejeon, Korea;School of Electrical Engineering and Computer Science, KAIST, Daejeon, Korea
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on gait analysis
Year:
2010

Citing 6
Cited 0

Fundamentals of speech recognition

Fundamentals of speech recognition
Convergence of the simulated annealing algorithm for continuous global optimization

Journal of Optimization Theory and Applications
The M2VTS Multimodal Face Database (Release 1.00)

AVBPA '97 Proceedings of the First International Conference on Audio- and Video-Based Biometric Person Authentication
A segment-based audio-visual speech recognizer: data collection, development, and initial experiments

Proceedings of the 6th international conference on Multimodal interfaces
An evaluation of visual speech features for the tasks of speech and speaker recognition

AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
A review of speech-based bimodal recognition

IEEE Transactions on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a novel stochastic optimization algorithm, hybrid simulated annealing (SA), to train hidden Markov models (HMMs) for visual speech recognition. In our algorithm, SA is combined with a local optimization. operator that substitutes a better solution for the current one to improve the convergence speed and the quality of solutions. We mathematically prove that the sequence of the objective values converges in probability to the global optimum in the algorithm. The algorithm is applied to train HMMs that are used as visual speech recognizers. While the popular training method of HMMs, the expectation-maximization algorithm, achieves only local optima in the parameter space, the proposed method can perform global optimization of the parameters of HMMs and thereby obtain solutions yielding improved recognition performance. The superiority of the proposed algorithm to the conventional ones is demonstrated via isolated word recognition experiments.