Encoding of Modified X-Y Trees for Document Classification

  • Authors:
  • Affiliations:
  • Venue:
  • ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Abstract: In this paper we describe a method for classifying document images on the basis of their physical layout. The layout is described by means of a hierarchical description, the Modified X-Y tree, that is derived by the classical X-Y tree segmentation algorithm taking into account cuts along lines in addition to cuts along white spaces between blocks. In order to reduce problems due to noise and skew of the input image, the Modified X-Y tree is built on top of regions extracted by a commercial OCR. The tree is afterwards coded into a fixed-size representation that takes into account occurrences of tree-patterns in the tree representing the page. Lastly, this feature vector is fed to an artificial neural network that is trained to classify document images. The system is applied to the classification of documents belonging to Digital Libraries, examples of classes taken into account are "title page", "index", "regular page". Many tests have been carried out on a data-set of more than 600 pages belonging to an on-line Digital Library. These tests allowed us to conclude that the use of MXY trees is advantageous with respect to the classical XY decomposition for this classification task.