Rejection Algorithm for Mis-segmented Characters In Multilingual Document Recognition

  • Authors:
  • Zhengang Chen;Xiaoqing Ding

  • Affiliations:
  • -;-

  • Venue:
  • ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In OCR systems the character segmentation algorithmmay generate mis-segmented blocks. Feedbackinformation from character classifier is indispensable toachieve higher character segmentation accuracy. In thispaper a novel rejection algorithm is proposed to identifythese mis-segmented characters more accurately. First,based on confidence evaluation of distance-basedclassifiers, the usual generalized confidence mappingfunction is modified to fit this specific purpose. Second, anovel adaptive thresholding rejection rule is proposed,which is more accurate and flexible. Experiments onChinese, Japanese and Korean document recognitionshowed that new rejection algorithm evidently improvedthe system performance, especially for low-qualityprinted document recognition.