Polynomial factorization: a success story
ISSAC '03 Proceedings of the 2003 international symposium on Symbolic and algebraic computation
Tabular abstraction, editing, and formatting
Tabular abstraction, editing, and formatting
A survey of table recognition: Models, observations, transformations, and inferences
International Journal on Document Analysis and Recognition
Using visual cues for extraction of tabular data from arbitrary HTML documents
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Transforming arbitrary tables into logical form with TARTAR
Data & Knowledge Engineering
Incremental Learning of First Order Logic Theories for the Automatic Annotations of Web Documents
ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02
From Tessellations to Table Interpretation
Calculemus '09/MKM '09 Proceedings of the 16th Symposium, 8th International Conference. Held as Part of CICM '09 on Intelligent Computer Mathematics
Hi-index | 0.00 |
Automatic interpretation of web tables can enable database-like semantic search over the plethora of information stored in tables on the web. Our table interpretation method presented here converts the two-dimensional hierarchy of table headers, which provides a visual means of assimilating complex data, into a set of strings that is more amenable to algorithmic analysis of table structure. We show that Header Paths, a new purely syntactic representation of visual tables, can be readily transformed ("factored") into several existing representations of structured data, including category trees and relational tables. Detailed examination of over 100 tables reveals what table features require further work.