Script Identification of Document Image Analysis

  • Authors:
  • Juan Cheng;Xijian Ping;Guanwei Zhou;Yang Yang

  • Affiliations:
  • Zhengzhou Information Science and Technology Institute, China;Zhengzhou Information Science and Technology Institute, China;Zhengzhou Information Science and Technology Institute, China;Zhengzhou Information Science and Technology Institute, China

  • Venue:
  • ICICIC '06 Proceedings of the First International Conference on Innovative Computing, Information and Control - Volume 3
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Script identification prior to OCR is necessary in document image analysis. And each script has unique spatial distribution and visual attribute that make it possible to identify itself from other languages. The key technology of script identification algorithm is to abstract effective measure feature. By analyzing vision differences based on normalized histogram statistic, Chinese, Japanese, English and Russian are identified respectively from others. Therefore, automatic identification of four scripts is realized successfully.