Zone Identification in the Printed Gujarati Text
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
A robust two level classification algorithm for text localization in documents
ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part II
Hi-index | 0.00 |
There are many types of documents where machine-printed and hand-written texts intermixedly appear. Since the optical character recognition (OCR) methodologies for machine-printed and hand-written texts are different, it is necessary to separate these two types of text before feeding them to the respective OCR systems. In this paper, we present such a scheme for both Bangla and Devnagari. The scheme is based on the structural and statistical features of the machine-printed and hand-written text lines. The classification scheme has an accuracy about 98.3%.