Zone Identification in the Printed Gujarati Text

Authors:
Jignesh Dholakia;Atul Negi;S. Rama Mohan
Affiliations:
The M.S. Univ. of Baroda. Gujarat, INDIA;Sciences, Univ. of Hyderabad, Hyderabad. INDIA;The M. S. Univ. of Baroda. Gujarat, INDIA.
Venue:
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Year:
2005

Citing 5
Cited 2

An OCR System to Read Two Indian Language Scripts: Bangla and Devnagari (Hindi)

ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Automatic Separation of Machine-Printed and Hand-Written Text Lines

ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Gujarati Character Recognition

ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
A Complete OCR for Printed Hindi Text in Devanagari Script

ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Document analysis system

IBM Journal of Research and Development

Gujarati handwritten numeral optical character reorganization through neural network

Pattern Recognition
Use of MKL as symbol classifier for Gujarati character recognition

DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Gujarati, is a language from the Indo-Aryan family of languages, used by 50 million people in the western part of India. Gujarati - script used to write the Gujarati language, is a multilevel script, written in three zones: base character zone, upper modifier zone and lower modifier zone. Several characters are discriminated by the specific modifiers, which exist in the upper and lower zones. Hence, detecting the zone boundaries is an important task in the Gujarati OCR. Although the Gujarati script is in some respects related to the Devanagari script, there are certain peculiar differences, which prevent the use of already known techniques for zone boundary detection for scripts such as Bengali, Assamese and Devanagari where mature OCR systems already do exist. There is only one previous documented effort for Gujarati OCR, in which an approach to recognize a small subset of Gujarati alphabet was discussed. The present paper proposes a sophisticated method for accurate zone detection in images of printed Gujarati. It is expected that this approach shall make the way smoother for the design and development of Gujarati OCR systems for complete character sets.