A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
Layout extraction of mixed mode documents
Machine Vision and Applications
The Document Spectrum for Page Layout Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
Multi-Skew Detection of Indian Script Documents
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Structuralizing digital ink for efficient selection
Proceedings of the 11th international conference on Intelligent user interfaces
Hi-index | 0.00 |
There are many artistic documents where text lines of a single page may have different inclinations (orientations). To enhance the ability of document analysis system, we have to extract text line in multiple orientations. In this paper, we propose a robust technique to detect English text lines of arbitrary orientation in a single document page. We propose here a bottom-up approach where the connected components are at first labelled. They are then clustered into word groups. Text lines of arbitrary orientation are identified from the estimation of these word groups. From an experiment of 3700 text lines, we obtained an accuracy of 98.3% by the proposed method.