A Tabular Survey of Automated Table Processing
GREC '99 Selected Papers from the Third International Workshop on Graphics Recognition, Recent Advances
Layout and Language: Preliminary Investigations in Recognizing the Structure of Tables
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Automatically Extracting Ontologically Specified Data from HTML Tables of Unknown Structure
ER '02 Proceedings of the 21st International Conference on Conceptual Modeling
Tabular abstraction, editing, and formatting
Tabular abstraction, editing, and formatting
Evaluation of Model-Based Interactive Flower Recognition
ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 2 - Volume 02
Computer assisted visual interactive recognition: caviar
Computer assisted visual interactive recognition: caviar
A survey of table recognition: Models, observations, transformations, and inferences
International Journal on Document Analysis and Recognition
Automating the extraction of data from HTML tables with unknown structure
Data & Knowledge Engineering - Special issue: ER 2002
Towards Ontology Generation from Tables
World Wide Web
The Recognition Strategy Language
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Towards domain-independent information extraction from web tables
Proceedings of the 16th international conference on World Wide Web
Business Specific Online Information Extraction from German Websites
CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
Tools for monitoring, visualizing, and refining collections of noisy documents
Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data
From Tessellations to Table Interpretation
Calculemus '09/MKM '09 Proceedings of the 16th Symposium, 8th International Conference. Held as Part of CICM '09 on Intelligent Computer Mathematics
Multi-character field recognition for Arabic and Chinese handwriting
SACH'06 Proceedings of the 2006 conference on Arabic and Chinese handwriting recognition
Analysis and taxonomy of column header categories for web tables
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Detecting and recognizing tables in spreadsheets
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Interactive conversion of web tables
GREC'09 Proceedings of the 8th international conference on Graphics recognition: achievements, challenges, and evolution
Using ontologies for extracting product features from web pages
ISWC'06 Proceedings of the 5th international conference on The Semantic Web
The HiLeX system for semantic information extraction
Transactions on Large-Scale Data- and Knowledge-Centered Systems V
Automatic transformation of multi-dimensional web tables into data cubes
DaWaK'12 Proceedings of the 14th international conference on Data Warehousing and Knowledge Discovery
Towards generic framework for tabular data extraction and management in documents
Proceedings of the sixth workshop on Ph.D. students in information and knowledge management
Hi-index | 0.00 |
The shift of interest to web tables in HTML and PDF files, coupled with the incorporation of table analysis and conversion routines in commercial desktop document processing software, are likely to turn table recognition into more of a systems than an algorithmic issue. We illustrate the transition by some actual examples of web table conversion. We then suggest that the appropriate target format for table analysis, whether performed by conventional customized programs or by off-the-shelf software, is a representation based on the abstract table introduced by X. Wang in 1996. We show that the Wang model is adequate for some useful tasks that prove elusive for less explicit representations, and outline our plans to develop a semi-automated table processing system to demonstrate this approach. Screen-snaphots of a prototype tool to allow table mark-up in the style of Wang are also presented.