Text Extraction and Document Image Segmentation Using Matched Wavelets and MRF Model

  • Authors:
  • S. . Kumar;R. . Gupta;N. . Khanna;S. . Chaudhury;S. D. Joshi

  • Affiliations:
  • IBM India Res. Lab., Delhi;-;-;-;-

  • Venue:
  • IEEE Transactions on Image Processing
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper, we have proposed a novel scheme for the extraction of textual areas of an image using globally matched wavelet filters. A clustering-based technique has been devised for estimating globally matched wavelet filters using a collection of groundtruth images. We have extended our text extraction scheme for the segmentation of document images into text, background, and picture components (which include graphics and continuous tone images). Multiple, two-class Fisher classifiers have been used for this purpose. We also exploit contextual information by using a Markov random field formulation-based pixel labeling scheme for refinement of the segmentation results. Experimental results have established effectiveness of our approach.