Lip tracking method for the system of audio-visual polish speech recognition

  • Authors:
  • Mariusz Kubanek;Janusz Bobulski;Lukasz Adrjanowicz

  • Affiliations:
  • Institute of Computer and Information Science, Czestochowa University of Technology, Czestochowa, Poland;Institute of Computer and Information Science, Czestochowa University of Technology, Czestochowa, Poland;Institute of Computer and Information Science, Czestochowa University of Technology, Czestochowa, Poland

  • Venue:
  • ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part I
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a method of tracking the lips in the system of audio-visual speech recognition. Presented methods consists of a face detector, face tracker, lip detector, lip tracker, and word classifier. In speech recognition systems, the audio signal is exposed to a large amount of acoustic noise, therefor scientists are looking for ways to reduce audio interference on recognition results. Visual speech is one of the sources that is not perturbed by the acoustic environment and noise. To analyze the video speech one has to develop a method of lip tracking. This work presents a method for automatic detection of the outer edges of the lips, which was used to identify individual words in audio-visual speech recognition. Additionally the paper also shows how to use video speech to divide the audio signal into phonemes.