The TEXbook
On the complexity of topological sorting
Information Processing Letters
OHSUMED: an interactive retrieval evaluation and new large test collection for research
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
INFORMys: A Flexible Invoice-Like Form-Reader System
IEEE Transactions on Pattern Analysis and Machine Intelligence
Journal of the ACM (JACM)
Twenty Years of Document Image Analysis in PAMI
IEEE Transactions on Pattern Analysis and Machine Intelligence
Machine Learning for Intelligent Processing of Printed Documents
Journal of Intelligent Information Systems - Special issue on methodologies for intelligent information systems
Geometric Structure Analysis of Document Images: A Knowledge-Based Approach
IEEE Transactions on Pattern Analysis and Machine Intelligence
Maintaining knowledge about temporal intervals
Communications of the ACM
The Art of Computer Programming Volumes 1-3 Boxed Set
The Art of Computer Programming Volumes 1-3 Boxed Set
Modern Information Retrieval
The Latex Companion
Automatic Knowledge Acquisition for Spatial Document Interpretation
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
A Document Classification and Extraction System with Learning Ability
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
A Two Level Knowledge Approach for Understanding Documents of a Multi-Class Domain
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Structured representations in a content based image retrieval context
Journal of Visual Communication and Image Representation
Hi-index | 0.00 |
We use a propositional language of qualitative rectangle relations to detect the reading order from document images. To this end, we define the notion of a document encoding rule and we analyze possible formalisms to express document encoding rules such as LaTeX and SGML. Document encoding rules expressed in the propositional language of rectangles are used to build a reading order detector for document images. In order to achieve robusmess and avoid brittleness when applying the system to real life document images, the notion of a thick boundary interpretation for a qualitative relation is introduced. The framework is tested on a collection of heterogeneous document images showing recall rates up to 89%.