Text - Image Separation in Devanagari Documents

  • Authors:
  • Swapnil Khedekar;Vemulapati Ramanaprasad;Srirangaraj Setlur;Venugopal Govindaraju

  • Affiliations:
  • -;-;-;-

  • Venue:
  • ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present a top-down, projection-profilebased algorithm to separate text blocks from image blocksin a Devanagari document. We use a distinctive feature ofDevanagari text, called Shirorekha (Header Line) to analyzethe pattern produced by Devanagari text in the horizontalprofile. The horizontal profile corresponding to a textblock possesses certain regularity in frequency, orientationand shows spatial cohesion. The algorithm uses these featuresto identify text blocks in a document image containingboth text and graphics.