Learning to segment document images

  • Authors:
  • K. S. Sesh Kumar;Anoop Namboodiri;C. V. Jawahar

  • Affiliations:
  • Centre for Visual Information Technology, International Institute of Information Technology, Hyderabad, India;Centre for Visual Information Technology, International Institute of Information Technology, Hyderabad, India;Centre for Visual Information Technology, International Institute of Information Technology, Hyderabad, India

  • Venue:
  • PReMI'05 Proceedings of the First international conference on Pattern Recognition and Machine Intelligence
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

A hierarchical framework for document segmentation is proposed as an optimization problem. The model incorporates the dependencies between various levels of the hierarchy unlike traditional document segmentation algorithms. This framework is applied to learn the parameters of the document segmentation algorithm using optimization methods like gradient descent and Q-learning. The novelty of our approach lies in learning the segmentation parameters in the absence of groundtruth.