Optical character recognition of Gurmukhi script using multiple classifiers

  • Authors:
  • Gurpreet Singh Lehal

  • Affiliations:
  • Punjabi University, Patiala, India

  • Venue:
  • Proceedings of the International Workshop on Multilingual OCR
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a robust and font independent Gurmukhi OCR system, which performs reasonably well on old documents as well. The OCR is based on four classifiers operating in serial and parallel mode. For combining the results of the classifiers operating in parallel mode, a corpus based weighted voting method is used. Combining multiple classifiers in such a way, that their individual weaknesses are compensated while their individual strengths are preserved, results in significantly better performance than what can be achieved with a single classifier. The problem of broken characters, which frequently appear in old documents, has also been tackled using a structural feature based algorithm.