Language independent skew estimation technique based on Gaussian mixture models: a case study on South Indian scripts

  • Authors:
  • V. N. Manjunath Aradhya;Ashok Rao;G. Hemantha Kumar

  • Affiliations:
  • Dept of Studies in Computer Science, University of Mysore, Mysore, India;Dept of Electronics and Communication, S.J. College of Engineering, Mysore, India;Dept of Studies in Computer Science, University of Mysore, Mysore, India

  • Venue:
  • PReMI'07 Proceedings of the 2nd international conference on Pattern recognition and machine intelligence
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

During document scanning, skew is inevitably introduced into the incoming document image. Presence of additional modified characters, which get plugged in as extensions and remain as disjointed protrusions of a main character is really challenging in estimating inclination in skewed documents made up of texts in south Indian languages (Kannada, Telugu, Tamil and Malayalam). In this paper, we present a novel script independent (for south Indian) skew estimation technique based on Gaussian Mixture Models (GMM). The Expectation-Maximization (EM) algorithm is used to learn the mixture of Gaussians. Subsequently the cluster means are subjected to moments to estimate the skew angle. Experiments on printed and handwritten documents corrupted by noise is done. Our method shows significantly improved performance as compared to other existing methods.