Evaluating and optimizing autonomous text classification systems
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
An Experimental Comparison of Range Image Segmentation Algorithms
IEEE Transactions on Pattern Analysis and Machine Intelligence
Representations and Metrics for Off-Line Handwriting Segmentation
IWFHR '02 Proceedings of the Eighth International Workshop on Frontiers in Handwriting Recognition (IWFHR'02)
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Applying the T-Recs Table Recognition System to the Business Letter Domain
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Three Approaches to "Industrial" Table Spotting
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Why Table Ground-Truthing is Hard
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
A survey of table recognition: Models, observations, transformations, and inferences
International Journal on Document Analysis and Recognition
An Approach towards Benchmarking of Table Structure Recognition Results
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Distance measures for image segmentation evaluation
EURASIP Journal on Applied Signal Processing
New Metrics for Evaluating Performance in Document Analysis Tasks_Application to the Table Case
ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 01
Performance Evaluation and Benchmarking of Six-Page Segmentation Algorithms
IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic table detection in document images
ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
A methodology for evaluating algorithms for table understanding in PDF documents
Proceedings of the 2012 ACM symposium on Document engineering
Document understanding of graphical content in natively digital PDF documents
Proceedings of the 2012 ACM symposium on Document engineering
Hi-index | 0.00 |
Table spotting and structural analysis are just a small fraction of tasks relevant when speaking of table analysis. Today, quite a large number of different approaches facing these tasks have been described in literature or are available as part of commercial OCR systems that claim to deal with tables on the scanned documents and to treat them accordingly. However, the problem of detecting tables is not yet solved at all. Different approaches have different strengths and weak points. Some fail in certain situations or layouts where others perform better. How shall one know, which approach or system is the best for his specific job? The answer to this question raises the demand for an objective comparison of different approaches which address the same task of spotting tables and recognizing their structure. This paper describes our approach towards establishing a complete and publicly available, hence open environment for the benchmarking of table spotting and structural analysis. We provide free access to the ground truthing tool and evaluation mechanism described in this paper, describe the ideas behind and we also provide ground truth for the 547 documents of the UNLV and UW-3 datasets that contain tables. In addition, we applied the quality measures to the results that were generated by the T-Recs system which we developed some years ago and which we started to further advance since a few months.