Mining tables from large scale HTML texts
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Specification retrieval – how to find attribute-value information on the web
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Automatic discovery of attribute words from web documents
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
A Novel Web-Oriented Writing Environment Using Objects' Facts Acquired from the Web
WI-IATW '07 Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops
Hi-index | 0.00 |
This paper presents a method for finding a specification page on the web for a given object (e.g."Titanic ö)and its class label (e.g."film ö). A specification page for an object is a web page which gives concise attribute-value information about the object (e.g."director ö-"James Cameron öfor "Titanic ö). A simple unsupervised method using layout and symbolic decoration cues was applied to a large number of web pages to acquire the class attributes. We used these acquired attributes to select a representative specification page for a given object from the web pages retrieved by a normal search engine. Experimental results revealed that our method greatly outperformed the normal search engine in terms of specification retrieval.