Processing of engineering line drawings for automatic input to CAD
Pattern Recognition
Design considerations for capturing an electronic library
Information Services and Use
Layout Recognition of Multi-Kinds of Table-Form Documents
IEEE Transactions on Pattern Analysis and Machine Intelligence
Digital Image Processing
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Understanding mathematical expressions from document images
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 2) - Volume 2
Description and recognition of form and automated form data entry
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 2) - Volume 2
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
An automated generation of an electronic library based on document image understanding
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1) - Volume 1
Region Segmentation for Table Image with Unknown Complex Structure
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Digital Libraries and Document Image Analysis
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Detection, Extraction and Representation of Tables
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Automated Segmentation of Math-Zones from Document Images
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
A survey of table recognition: Models, observations, transformations, and inferences
International Journal on Document Analysis and Recognition
Hi-index | 0.00 |
We propose an algorithm to separate out tables and math-zones from document images. The algorithm relies on the spatial characteristics of tables and math-zones in a document. It has been observed that tables have distinct columns which imply that gaps between the fields are substantially larger than the gaps between the words in text lines and in math-zones the characters and symbols are less dense in comparison to normal text lines. These deceptively simple observations have led us to design a simple but powerful table and math-zone detection system with low computation cost.