Decomposing document images by heuristic search

  • Authors:
  • Dashan Gao;Yizhou Wang

  • Affiliations:
  • Dept. of Electrical and Computer Engineering, University of California, San Diego, La Jolla, CA;Palo Alto Research Center, Palo Alto, CA

  • Venue:
  • EMMCVPR'07 Proceedings of the 6th international conference on Energy minimization methods in computer vision and pattern recognition
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Document decomposition is a basic but crucial step for many document related applications. This paper proposes a novel approach to decompose document images into zones. It first generates overlapping zone hypotheses based on generic visual features. Then, each candidate zone is evaluated quantitatively by a learned generative zone model. We infer the optimal set of non-overlapping zones that covers a given document image by a heuristic search algorithm. The experimental results demonstrate that the proposed method is very robust to document structure variation and noise.