Maximum-minimum similarity training for text extraction

  • Authors:
  • Hui Fu;Xiabi Liu;Yunde Jia

  • Affiliations:
  • School of Computer Science and Technology, Beijing Institute of Technology, Beijing, P.R. China;School of Computer Science and Technology, Beijing Institute of Technology, Beijing, P.R. China;School of Computer Science and Technology, Beijing Institute of Technology, Beijing, P.R. China

  • Venue:
  • ICONIP'06 Proceedings of the 13th international conference on Neural information processing - Volume Part III
  • Year:
  • 2006

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper, the discriminative training criterion of maximumminimum similarity (MMS) is used to improve the performance of text extraction based on Gaussian mixture modeling of neighbor characters. A recognizer is optimized in the MMS training through maximizing the similarities between observations and models from the same classes, and minimizing those for different classes. Based on this idea, we define the corresponding objective function for text extraction. Through minimizing the objective function by using the gradient descent method, the optimum parameters of our text extraction method are obtained. Compared with the maximum likelihood estimation (MLE) of parameters, the result trained with the MMS method makes the overall performance of text extraction improved greatly. The precision rate decreased little from 94.59% to 93.56%, but the recall rate increased a lot from 80.39% to 98.55%.