Picture detection in document page images

  • Authors:
  • Patrick Chiu;Francine Chen;Laurent Denoue

  • Affiliations:
  • FX Palo Alto Laboratory, Palo Alto, CA, USA;FX Palo Alto Laboratory, Palo Alto, CA, USA;FX Palo Alto Laboratory, Palo Alto, CA, USA

  • Venue:
  • Proceedings of the 10th ACM symposium on Document engineering
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a method for picture detection in document page images, which can come from scanned or camera images, or rendered from electronic file formats. Our method uses OCR to separate out the text and applies the Normalized Cuts algorithm to cluster the non-text pixels into picture regions. A refinement step uses the captions found in the OCR text to deduce how many pictures are in a picture region, thereby correcting for under- and over-segmentation. A performance evaluation scheme is applied which takes into account the detection quality and fragmentation quality. We benchmark our method against the ABBYY application on page images from conference papers.