Real-time online multimedia content processing: mobile video optical character recognition and speech synthesizer for the visual impaired

  • Authors:
  • Shi-Yong Neo;Hai-Kiat Goh;Wendy Yen-Ni Ng;Jun-Da Ong;Wilson Pang

  • Affiliations:
  • SOC, Singapore;SOC, Singapore;Ministry of Education, Singapore;Kai Square Ptd Ltd, Singapore;Kai Square Ptd Ltd, Singapore

  • Venue:
  • Proceedings of the 1st international convention on Rehabilitation engineering & assistive technology: in conjunction with 1st Tan Tock Seng Hospital Neurorehabilitation Meeting
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

One of the common difficulties faced by the visually impaired is the inability to read and thus affecting their way of life. Existing portable reading devices (using character recognition and speech synthesis) have many limitations and poor in accuracy due to restrictive processing power. In this paper, we introduce our robust online multimedia content processing framework to alleviate the limitations of such portable devices. We leverage high transfer speed using existing wireless networks to send multimedia information captured from mobile devices to high-end processing servers and subsequently stream the desired output back to users. The resultant framework enables more complex processes as they are carried out on the servers and thus outperforms standard portable devices in terms of accuracy and functionalities. In addition, we describe a new approach to improve optical character recognition (OCR) results by using consecutive video frames for automatic character correction. Experiments using consecutive frames show an improvement in 25% accuracy over traditional OCR using a single image. The application is also trialed by several visually impaired personnel and the feedback obtained is encouraging.