A machine learning based approach for table detection on the web
Proceedings of the 11th international conference on World Wide Web
Fuzzy Segmentation of Characters in Web Images Based on Human Colour Perception
DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
Identifying Story and Preview Images in News Web Pages
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Colour text segmentation in web images based on human perception
Image and Vision Computing
Vertical Navigation of Layout Adapted Web Documents
World Wide Web
Towards domain-independent information extraction from web tables
Proceedings of the 16th international conference on World Wide Web
WebTables: exploring the power of tables on the web
Proceedings of the VLDB Endowment
Table extraction using spatial reasoning on the CSS2 visual box model
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Detecting tables in Web documents
Engineering Applications of Artificial Intelligence
A fine-grained taxonomy of tables on the web
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Web-scale table census and classification
Proceedings of the fourth ACM international conference on Web search and data mining
An efficient pre-processing method to identify logical components from PDF documents
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
Diction based prosody modeling in table-to-speech synthesis
TSD'05 Proceedings of the 8th international conference on Text, Speech and Dialogue
Hi-index | 0.00 |
Abstract: We propose a set of baseline heuristics for identifying genuinely tabular information and news links in HTML documents. A prototype implementation of these heuristics is described for delivering content from news providers' home pages to a narrow-bandwidth device such as a portable digital assistant or cellular phone display. Its evaluation on 75 web-sites is provided, along with a discussion of topics for future research.