A Comprehensive Image Processing Suite for Book Re-mastering

  • Authors:
  • Jian Fan;Xiaofan Lin;Steven Simske

  • Affiliations:
  • Hewlett-Packard Laboratories;Hewlett-Packard Laboratories;Hewlett-Packard Laboratories

  • Venue:
  • ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Converting paper books into electronic form provides benefits for archiving, distribution and content reuse. However, directly scanned images are usually undesirable for electronic books, and automated content re-mastering is required. In this paper, we describe a comprehensive image processing suite consisting of three major components: 1) image enhancement with deskew, cropping, color correction, contrast enhancement and text sharpening, 2) compound document image compression, and 3) extraction of TOC (table of content) and linking. We built a processing pipeline that automatically converts a set of scanned page images into a high quality and highly compressed e-book in the popular PDF format.