Document Image Coding for Processing and Retrieval

  • Authors:
  • Omid E. Kia;David S. Doermann

  • Affiliations:
  • National Institute of Standards and Technology, Mathematical and Computational Sciences Division, Building 820, Room 365, Gaithersburg, MD 20899;Language and Media Processing Laboratory, Center for Automation Research, University of Maryland, College Park, MD 20742

  • Venue:
  • Journal of VLSI Signal Processing Systems - special issue on multimedia signal processing
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

Document images belong to a unique class of images where theinformation is embedded in the language represented by a series ofsymbols on the page rather than in the visual objectsthemselves. Since these symbols tend to appear repeatedly, adomain-specific image coding strategy can be designed to facilitateenhanced compression and retrieval. In this paper we describe a codingmethodology that not only exploits component-level redundancy toreduce code length but also supports efficient data access. Theapproach identifies and organizes symbol patterns which appearrepeatedly. Similar components are represented by a single prototypestored in a library and the location of each component instance iscoded along with the residual between it and its prototype. Arepresentation is built which provides a natural information indexallowing access to individual components. Compression results arecompetitive and compressed-domain access is superior to competingmethods. Applications to network-related problems have beenconsidered, and show promising results.