Off-line isolated handwritten Thai OCR using island-based projection with n-gram model and hidden Markov models

  • Authors:
  • Thanaruk Theeramunkong;Chainat Wongtapan

  • Affiliations:
  • Information Technology Program, Sirindhorn International Institute of Technology, Thammasat University, Pathumthani 12121, Thailand;Department of Computer Science, Faculty of Science, Payap University, Chiangmai, Thailand

  • Venue:
  • Information Processing and Management: an International Journal - Special issue: An Asian digital libraries perspective
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many traditional works on off-line Thai handwritten character recognition used a set of local features including circles, concavity, endpoints and lines to recognize hand-printed characters. However, in natural handwriting, these local features are often missing due to rough or quick writing, resulting in dramatic reduction of recognition accuracy. Instead of using such local features, this paper presents a method called multi-directional island-based projection to extract global features from handwritten characters. As the recognition model, two statistical approaches, namely interpolated n-gram model (n-gram) and hidden Markov model (HMM), are proposed. The experimental results indicate that the proposed scheme achieves high accuracy in the recognition of naturally-written Thai characters with numerous variations, compared to some common previous feature extraction techniques. Another experiment with English characters also displays quite promising results.