INFTY: an integrated OCR system for mathematical documents

  • Authors:
  • Masakazu Suzuki;Fumikazu Tamari;Ryoji Fukuda;Seiichi Uchida;Toshihiro Kanahori

  • Affiliations:
  • Kyushu University, Japan;Fukuoka University of Education, Japan;Oita University, Japan;Kyushu University, Japan;Tsukuba College of Technology, Japan

  • Venue:
  • Proceedings of the 2003 ACM symposium on Document engineering
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

An integrated OCR system for mathematical documents, called INFTY, is presented. INFTY consists of four procedures, i.e., layout analysis, character recognition, structure analysis of mathematical expressions, and manual error correction. In those procedures, several novel techniques are utilized for better recognition performance. Experimental results on about 500 pages of mathematical documents showed high character recognition rates on both mathematical expressions and ordinary texts, and sufficient performance on the structure analysis of the mathematical expressions.