Interfered-Character Recognition by Removing Interfering-Lines and Adjusting Feature Weights

  • Authors:
  • Affiliations:
  • Venue:
  • ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 2 - Volume 2
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

Characters sometimes overlap with non-textual lines in form documents and these interfered-characters would generally be recognized with poor accuracy. In this paper, we propose a two-step interfering-line removing method. Positions and orientations of interfering-lines are first detected by the Hough transform. Interferingline widths are then determined from projection histograms. An ambiguous area is defined to bound an interfering-line. Black runs in the ambiguous are classified into four types and grouped into run-groups. The directions of hidden character strokes in each run-groups are predicted. Black pixels located in these hidden strokes are regarded as character pixels and the other black pixels are considered as interfering pixels, which will be removed then. Most OCR engines are trained by noninterfered sample characters. In order to recognize interfered-characters, we adjust the feature values by assigning a stability value to each of sub-regions. In our collected 1820 interfered handwritten Chinese characters, the recognition accuracy was 24.02% for interfered-characters and 89. 91 % for characters after removing interfering-lines.