Exploiting Fisher Kernels in Decoding Severely Noisy Document Images

  • Authors:
  • J. Chen;Y. Wang

  • Affiliations:
  • Palo Alto Research Center, 3333 Coyote Hill Road, Palo Alto, CA 94304-1314, USA;Palo Alto Research Center, 3333 Coyote Hill Road, Palo Alto, CA 94304-1314, USA

  • Venue:
  • ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 01
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Decoding noisy document images is commonly needed in applications such as enterprise content management. Available OCR solutions are still not satisfactory especially on noisy images, and re-trainable systems require difficult and tedious training example preparation. Motivated by this challenging real application, we propose a novel so- lution that organically combines generative template mod- els with discriminative classifiers via RBF Fisher kernel de- rived from a generative model. We show that the new ap- proach is highly accurate in decoding noisy document im- ages, making the system more generalizable to variations in font and degradation, and hence significantly reduces the burden in training example preparation. We also show that as it weights the pixel features by their relevancies, RBF Fisher kernel is more robust, and leads to smaller, faster models by dimensionality reduction.