Understanding Digital Documents Using Gestalt Properties of Isothetic Components

  • Authors:
  • Shyamosree Pal;Partha Bhowmick;Arindam Biswas;Bhargab B. Bhattacharya

  • Affiliations:
  • Indian Institute of Technology,Kharagpur, India;Indian Institute of Technology,Kharagpur, India;Bengal Engineering and Science University,Shibpur,Howrah, India;Indian Statistical Institute,Kolkata, India

  • Venue:
  • International Journal of Digital Library Systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper introduces how Gestalt properties can be used for identifying various components in a document image. That the human mind makes a holistic approach to vision rather than a disintegrated approach is shown to be useful for document analysis. Since the major constituent components textual or non-textual in a document page are arranged in a rectilinear fashion, rectilinear/isothetic decomposition of different components are made on a document page. After representing the page as a feature set of its polygonal covers corresponding to the distinct regions of interest, each polygon is iteratively decomposed into the sub-polygons tightly enclosing the corresponding sub-components to capture the overall information as well as the necessary details to the desired level of precision. Subsequently, these components and sub-components are analyzed using Gestalt laws/properties, which have been explained in detail in the context of this work. Text regions, tabular structures, and various graphic objects readily admit some of the Gestalt properties. We have tested our algorithm on several benchmark datasets, and some relevant results have been produced here to demonstrate the effectiveness and elegance of the proposed method.