A stroke regeneration method for cleaning rule-lines in handwritten document images

  • Authors:
  • Huaigu Cao;Rohit Prasad;Prem Natarajan

  • Affiliations:
  • BBN Technologies, Cambridge, MA;BBN Technologies, Cambridge, MA;BBN Technologies, Cambridge, MA

  • Venue:
  • Proceedings of the International Workshop on Multilingual OCR
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe a rule-line removal algorithm for handwritten document images in this paper. Compared to the existing approaches, our algorithm obtains more scalability to higher-resolution images and thicker rule-lines. Derived from the simple gap-filling methods using line-drawing algorithms, we present a novel approach to regenerating the missing portions of text strokes. Using this approach, the deformed text can be restored to its original shape. We also explore the noise filtering method for binarized document images, in particular by choosing the morphological operator in accordance with the noise power of the input image. Our approach has proven to be effective by experiments on both real and synthetic handwritten document images.