Minimum Risk Training for Handwritten Chinese/Japanese Text Recognition Using Semi-Markov Conditional Random Fields

  • Authors:
  • Xiang-Dong Zhou;Feng Tian;Cheng-Lin Liu;Hong-An Wang

  • Affiliations:
  • -;-;-;-

  • Venue:
  • ICDAR '13 Proceedings of the 2013 12th International Conference on Document Analysis and Recognition
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Semi-Markov conditional random fields (semi-CRFs) are usually trained with maximum a posteriori (MAP) criterion which adopts the 0/1 cost for measuring the loss of misclassification. In this paper, based on our previous work on handwritten Chinese/Japanese text recognition (HCTR) using semi-CRFs, we propose an alternative parameter learning method by minimizing the risk, in which the misclassification costs are not equal, but different depending on the hypothesis and the ground-truth. The proposed method is lattice-based, i.e., the hypothesis space is the entire lattice on which the semi-CRF is defined. Experimental results on two online handwriting databases: CASIA-OLHWDB and TUAT Kondate demonstrate that minimum-risk training can yield superior string recognition rates compared to MAP training.