Segmenting Document Images Using Diagonal White Runs and Vertical Edges

  • Authors:
  • Boulos Waked

  • Affiliations:
  • -

  • Venue:
  • ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Abstract: We introduce a technique based on diagonal white runs and vertical edges, that divides a document image into columns and blocks which are subsequently classified as text or graphics. A diagonal white run (drun) is a set of adjacent white pixels that are diagonally connected, and a vertical edge consists of the white area between two consecutive druns. This technique was designed as a layout independent approach. Testing the proposed approach on document images with 14 different types and layouts, written in different languages, shows comparative and promising results.