Automatic Detection of Document Script and Orientation

  • Authors:
  • S. J. Lu;C.-L. Tan

  • Affiliations:
  • National University of Singapore, Kent Ridge, 117543, Singapore;National University of Singapore, Kent Ridge, 117543, Singapore

  • Venue:
  • ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 01
  • Year:
  • 2007

Quantified Score

Hi-index 0.03

Visualization

Abstract

This paper presents an identification technique that au- tomatically detects the underlying script and orientation of scanned document images. In the proposed technique, document script and orientation are identified by using the stroke density and distribution, which convert each docu- ment image into a document vector. For each script at each orientation, a number of reference document vectors are first constructed. Script and orientation of the query document are then determined according to the similar- ity between the query document vector and multiple pre- constructed reference document vectors by using the K- nearest neighbor algorithm. Experiments show that the pro- posed technique is tolerant to the document skew and able to detect orientations of documents of different scripts.