Model-Guided Segmentation and Layout Labelling of Document Images Using a Hierarchical Conditional Random Field

  • Authors:
  • Santanu Chaudhury;Megha Jindal;Sumantra Dutta Roy

  • Affiliations:
  • Dept of Electrical Engg, IIT Delhi, Haux Khas, New Delhi, India 110 016;Dept of Electrical Engg, IIT Delhi, Haux Khas, New Delhi, India 110 016;Dept of Electrical Engg, IIT Delhi, Haux Khas, New Delhi, India 110 016

  • Venue:
  • PReMI '09 Proceedings of the 3rd International Conference on Pattern Recognition and Machine Intelligence
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a model-guided segmentation and document layout extraction scheme based on hierarchical Conditional Random Fields (CRFs, hereafter). Common methods to classify a pixel of a document image into classes - text, background and image - are often noisy, and error-prone, often requiring post-processing through heuristic methods. The input to the system is a pixel-wise classification based on the output of a Fisher classifier based on the output of a set of Globally Matched Wavelet (GMW) Filters. The system extracts features which encode contextual information and spatial configurations of a given document image, and learns relations between these layout entities using hierarchical CRFs. The hierarchical CRF enables learning at various levels - 1. local features for text, background and image areas; 2. contextual features for further classifying region blocks - title, author block, heading, paragraph, etc.; and 3. probabilistic layout model for encoding global relations between the above blocks for a particular class of documents. Although the work has been motivated for an automated layout analyser and machine translator for technical papers, it can also be used for other applications such as search, indexing and information retrieval.