Automatic Separation of Machine-Printed and Hand-Written Text Lines

  • Authors:
  • U. Pal;B. B. Chaudhuri

  • Affiliations:
  • -;-

  • Venue:
  • ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

There are many types of documents where machine-printed and hand-written texts intermixedly appear. Since the optical character recognition (OCR) methodologies for machine-printed and hand-written texts are different, it is necessary to separate these two types of text before feeding them to the respective OCR systems. In this paper, we present such a scheme for both Bangla and Devnagari. The scheme is based on the structural and statistical features of the machine-printed and hand-written text lines. The classification scheme has an accuracy about 98.3%.