Building compact recognizers of handwritten Chinese characters using precision constrained Gaussian model, minimum classification error training and parameter compression

  • Authors:
  • Yongqiang Wang;Qiang Huo

  • Affiliations:
  • The University of Hong Kong, Department of Computer Science, Hong Kong, China;MSRA, 5/F, Beijing Sigma Center, 49 Zhichun Road, Haidian District, 100190, Beijing, People’s Republic of China

  • Venue:
  • International Journal on Document Analysis and Recognition - Special issue - Selected and extended papers from ICDAR2009
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In our previous work, a so-called precision constrained Gaussian model (PCGM) was proposed for character modeling to design compact recognizers of handwritten Chinese characters. A maximum likelihood training procedure was developed to estimate model parameters from training data. In this paper, we extend the above-mentioned work by using minimum classification error (MCE) training to improve recognition accuracy and using both split vector quantization and scalar quantization techniques to further compress model parameters. Experimental results on a handwritten character recognition task with a vocabulary of 2,965 Kanji characters demonstrate that MCE-trained and compressed PCGM-based classifiers can achieve much higher recognition accuracies than their counterparts based on traditional modified quadratic discriminant function (MQDF) when the footprint of the classifiers has to be made very small, e.g., less than 2 MB.