Semantic similarity based ontology cache

Authors:
Bangyong Liang;Jie Tang;Juanzi Li;Kehong Wang
Affiliations:
Knowledge Engineering Group, Department of Computer Science, Tsinghua University, Beijing, China;Knowledge Engineering Group, Department of Computer Science, Tsinghua University, Beijing, China;Knowledge Engineering Group, Department of Computer Science, Tsinghua University, Beijing, China;Knowledge Engineering Group, Department of Computer Science, Tsinghua University, Beijing, China
Venue:
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Year:
2006

Citing 6
Cited 2

A predicate matching algorithm for database rule systems

SIGMOD '90 Proceedings of the 1990 ACM SIGMOD international conference on Management of data
Unparsing RDF/XML

Proceedings of the 11th international conference on World Wide Web
Towards Intelligent Semantic Caching for Web Sources

Journal of Intelligent Information Systems
Weaving the Web: The Original Design and Ultimate Destiny of the World Wide Web by Its Inventor

Weaving the Web: The Original Design and Ultimate Destiny of the World Wide Web by Its Inventor
Semantic Data Caching and Replacement

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
A predicate-based caching scheme for client-server database architectures

The VLDB Journal — The International Journal on Very Large Data Bases

An approach to XML path matching

Proceedings of the 9th annual ACM international workshop on Web information and data management
On the Semantics of Trust and Caching in the Semantic Web

ISWC '08 Proceedings of the 7th International Conference on The Semantic Web

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper addresses the issue of ontology caching on semantic web. The Semantic Web is an extension of the current web in which information is given well-defined meaning, better enabling computers and people to work in cooperation. Ontology serves as the metadata for defining the information on semantic web. Ontology based semantic information retrieval (semantic retrieval) is becoming more and more important. Many research and industrial works have been made so far on semantic retrieval. Ontology based retrieval improves the performance of search engine and web mining. In semantic retrieval, a great number of accesses to ontologies usually lead the ontology servers to be very low efficient. To address this problem, it is indeed necessary to cache concepts and instances when ontology server is running. Existing caching methods from database community can be used in the ontology cache. However, they are not sufficient for dealing with the problem. In the task of caching in database, usually the most frequently accessed data are cached and the recently less frequently accessed data in the cache are removed from it. Different from that, in ontology base, data are organized as objects and relations between objects. User may request one object, and then request another object according to a relation of that object. He may also possibly request a similar object that has not any relations to the object. Ontology caching should consider more factors and is more difficult. In this paper, ontology caching is formalized as a problem of classification. In this way, ontology caching becomes independent from any specific semantic web application. An approach is proposed by using machine learning methods. When an object (e.g. concept or instance) is requested, we view its similar objects as candidates. A classification model is then used to predict whether each of these candidates should be cached or not. Features in classification models are defined. Experimental results indicate that the proposed methods can significantly outperform the baseline methods for ontology caching. The proposed method has been applied to a research project that is called SWARMS.