Devising Interactive Access Techniques for Indian Language Document Images
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Document zone content classification and its performance evaluation
Pattern Recognition
A survey of keyword spotting techniques for printed document images
Artificial Intelligence Review
Hi-index | 0.00 |
Abstract: This paper presents a new model based document image segmentation scheme that uses XML-DTDs (eXtensible Mark-up Language-Document Type Definition). Given a document image, the algorithm has the ability to select the appropriate model. A new wavelet based tool has been designed for distinguishing text from non-text regions and characterization of font sizes. Our model based analysis scheme makes use of this tool for identifying the logical components of a document image.