Feature extraction and classification for bilingual script (Gurmukhi and Roman)

Authors:
Renu Dhir
Affiliations:
Dr. B.R. Ambedkar National Institute of Technology, Jalandhar, Punjab, India
Venue:
ACST'07 Proceedings of the third conference on IASTED International Conference: Advances in Computer Science and Technology
Year:
2007

Citing 4
Cited 1

A fast parallel algorithm for thinning digital patterns

Communications of the ACM
Automatic Separation of Words in Multi-lingual Multi-script Indian Documents

ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Classification of Oriental and European Scripts by Using Characteristic Features

ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Script Line Separation from Indian Multi-Script Documents

ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition

Digit extraction and recognition from machine printed Gurmukhi documents

Proceedings of the International Workshop on Multilingual OCR

Quantified Score

Hi-index	0.00

Visualization

Abstract

The capability of recognizing multilingual documents is both novel and useful. With such capability, many applications can be supported including multilingual access to patent, business and regulatory information, translation, and keyword finding in document images. The main purpose of our research will be development of the methodology of a single OCR system, which will process bilingual documents typed in both Gurmukhi (Punjabi) and Roman (English). The OCR will automatically recognize the script of each word of the document and invoke the appropriate recognition engine and recognize that word.