Automatic Script Identification From Document Images Using Cluster-Based Templates

  • Authors:
  • Judith Hochberg;Patrick Kelly;Timothy Thomas;Lila Kerns

  • Affiliations:
  • -;-;-;-

  • Venue:
  • IEEE Transactions on Pattern Analysis and Machine Intelligence
  • Year:
  • 1997

Quantified Score

Hi-index 0.14

Visualization

Abstract

We describe an automated script identification system for typeset document images. Templates for each script are created by clustering textual symbols from a training set. Symbols from new images are compared to the templates to find the best script. Our current system processes thirteen scripts with minimal preprocessing and high accuracy.