Enhanced hypertext categorization using hyperlinks
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Web classification using support vector machine
Proceedings of the 4th international workshop on Web information and data management
Ontology-Based Automatic Classification for the Web Pages: Design, Implementation and Evaluation
WISE '02 Proceedings of the 3rd International Conference on Web Information Systems Engineering
Automatic Classification of Web Pages based on the Concept of Domain Ontology
APSEC '05 Proceedings of the 12th Asia-Pacific Software Engineering Conference
Yago: a core of semantic knowledge
Proceedings of the 16th international conference on World Wide Web
NAGA: harvesting, searching and ranking knowledge
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Concept vector extraction from Wikipedia category network
Proceedings of the 3rd International Conference on Ubiquitous Information Management and Communication
Hi-index | 0.00 |
In this paper, we propose a novel method to classify Web pages by using knowledge bases for entity search, which is a kind of typical Web search for information related to a person, location or organization. First, we map a Web page to entities according to the similarities between the page and the entities. Various methods for computing such similarity are applied. For example, we can compute the similarity between a given page and a Wikipedia article describing a certain entity. The frequency of an entity appearing in the page is another factor used in computing the similarity. Second, we construct a directed acyclic graph, named PEC graph, based on the relations among Web pages, entities, and categories, by referring to YAGO, a knowledge base built on Wikipedia and WordNet. Finally, by analyzing the PEC graph, we classify Web pages into categories. The results of some preliminary experiments validate the methods proposed in this paper.