Finding the best-fit bounding-boxes

  • Authors:
  • Bo Yuan;Leong Keong Kwoh;Chew Lim Tan

  • Affiliations:
  • Centre for Remote Imaging, Sensing and Processing, National University of Singapore, Singapore;Centre for Remote Imaging, Sensing and Processing, National University of Singapore, Singapore;Department of Computer Science, School of Computing, National University of Singapore, Singapore

  • Venue:
  • DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The bounding-box of a geometric shape in 2D is the rectangle with the smallest area in a given orientation (usually upright) that complete contains the shape. The best-fit bounding-box is the smallest bounding-box among all the possible orientations for the same shape. In the context of document image analysis, the shapes can be characters (individual components) or paragraphs (component groups). This paper presents a search algorithm for the best-fit bounding-boxes of the textual component groups, whose shape are customarily rectangular in almost all languages. One of the applications of the best-fit bounding-boxes is the skew estimation from the text blocks in document images. This approach is capable of multi-skew estimation and location, as well as being able to process documents with sparse text regions. The University of Washington English Document Image Database (UW-I) is used to verify the skew estimation method directly and the proposed best-fit bounding-boxes algorithm indirectly.