Text line segmentation for gray scale historical document images

  • Authors:
  • Abedelkadir Asi;Raid Saabni;Jihad El-Sana

  • Affiliations:
  • Ben-Gurion University of the Negev, Beer Sheva, Israel;Tel-Aviv University, Tel-Aviv, Israel and TRDC, Kafr Qarea, Israel;Ben-Gurion University of the Negev, Beer Sheva, Israel

  • Venue:
  • Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
  • Year:
  • 2011

Quantified Score

Hi-index 0.02

Visualization

Abstract

In this paper we present a new approach for text line segmentation that works directly on gray-scale document images. Our algorithm constructs distance transform directly on the gray-scale images, which is used to compute two types of seams: medial seams and separating seams. A medial seam is a chain of pixels that crosses the text area of a text line and a separating seam is a path that passes between two consecutive rows. The medial seam determines a text line and the separating seams define the upper and lower boundaries of the text line. The medial and separating seams propagate according to energy maps, which are defined based on the constructed distance transform. We have performed various experimental results on different datasets and received encouraging results.