Semantic search on Internet tabular information extraction for answering queries
Proceedings of the ninth international conference on Information and knowledge management
Recognizing records from the extracted cells of microfilm tables
Proceedings of the 2002 ACM symposium on Document engineering
A Tabular Survey of Automated Table Processing
GREC '99 Selected Papers from the Third International Workshop on Graphics Recognition, Recent Advances
Using the structure of Web sites for automatic segmentation of tables
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Acoustic Rendering of Data Tables Using Earcons and Prosody for Document Accessibility
UAHCI '09 Proceedings of the 5th International Conference on Universal Access in Human-Computer Interaction. Part III: Applications and Services
Detecting and recognizing tables in spreadsheets
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Diction based prosody modeling in table-to-speech synthesis
TSD'05 Proceedings of the 8th international conference on Text, Speech and Dialogue
Notes on contemporary table recognition
DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
Hi-index | 0.00 |
We describe a prototype system for assigning table cells to their proper place in the logical structure of the table, based on a simple model of table structure combined with a number of measures of \term{cohesion} between cells. A framework is presented for examining the effect of particular variables on the performance of the system, and preliminary results are presented showing the effect of cohesion measures based on the simplest domain-independent analyses, with the aim allowing future comparison with more knowledge-intensive analyses based on Natural Language Processing. These baseline results suggest that very simple string-based cohesion measures are not sufficient to support the extraction of tuples as we require. Future work will pursue the aim of more adequate approximations to a notional subtype/supertype definition of the relationship between value cells and label cell.