A Model-based Line Detection Algorithm in Documents

  • Authors:
  • Yefeng Zheng;Huiping Li;David Doermann

  • Affiliations:
  • -;-;-

  • Venue:
  • ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present a novel model based approach todetect severely broken parallel lines in noisy textual documents.It is important to detect and remove these lines so thetext can be segmented and recognized. We use DirectionalSingle-Connected Chain, a vectorization based algorithm,to extract the line segments. We then instantiate a parallelline model with three parameters: the skew angle, the verticalline gap, and the vertical translation. A coarse-to-fineapproach is used to improve the estimation accuracy. Fromthe model we can incorporate the high level contextual informationto enhance detection results even when lines areseverely broken. Our experimental results show our methodcan detect 94% of the lines in our database with 168 noisyArabic document images.