A fast parallel algorithm for thinning digital patterns
Communications of the ACM
Automatic Separation of Words in Multi-lingual Multi-script Indian Documents
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Classification of Oriental and European Scripts by Using Characteristic Features
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Script Line Separation from Indian Multi-Script Documents
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Digit extraction and recognition from machine printed Gurmukhi documents
Proceedings of the International Workshop on Multilingual OCR
Hi-index | 0.00 |
The capability of recognizing multilingual documents is both novel and useful. With such capability, many applications can be supported including multilingual access to patent, business and regulatory information, translation, and keyword finding in document images. The main purpose of our research will be development of the methodology of a single OCR system, which will process bilingual documents typed in both Gurmukhi (Punjabi) and Roman (English). The OCR will automatically recognize the script of each word of the document and invoke the appropriate recognition engine and recognize that word.