Efficient Index Maintenance for Frequently Updated Semantic Data

Authors:
Yan Liang;Haofen Wang;Qiaoling Liu;Thanh Tran;Thomas Penin;Yong Yu
Affiliations:
Department of Computer Science & Engineering, Shanghai Jiao Tong University, Shanghai, China;Department of Computer Science & Engineering, Shanghai Jiao Tong University, Shanghai, China;Department of Computer Science & Engineering, Shanghai Jiao Tong University, Shanghai, China;Institute AIFB, Universität Karlsruhe, Germany;Department of Computer Science & Engineering, Shanghai Jiao Tong University, Shanghai, China;Department of Computer Science & Engineering, Shanghai Jiao Tong University, Shanghai, China
Venue:
ASWC '08 Proceedings of the 3rd Asian Semantic Web Conference on The Semantic Web
Year:
2008

Citing 17
Cited 0

The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
An effective mechanism for index update in structured documents

Proceedings of the eighth international conference on Information and knowledge management
Storing and querying ordered XML using a relational database system

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Indexing and Querying XML Data for Regular Path Expressions

Proceedings of the 27th International Conference on Very Large Data Bases
Fast Incremental Indexing for Full-Text Information Retrieval

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Querying the Semantic Web: A Formal Approach

ISWC '02 Proceedings of the First International Semantic Web Conference on The Semantic Web
A hybrid approach for searching in the semantic web

Proceedings of the 13th international conference on World Wide Web
Swoogle: a search and metadata engine for the semantic web

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Efficient online index maintenance for contiguous inverted lists

Information Processing and Management: an International Journal
Hybrid index maintenance for growing text collections

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Semantic search via XML fragments: a high-precision approach to IR

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Efficient Update of Indexes for Dynamically Changing Web Documents

World Wide Web
ESTER: efficient search on text, entities, and relations

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Just in time indexing for up to the second search

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
LUBM: A benchmark for OWL knowledge base systems

Web Semantics: Science, Services and Agents on the World Wide Web
Sindice.com: weaving the open linked data

ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
Semplore: an IR approach to scalable hybrid query of semantic web data

ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

Nowadays, the demand on querying and searching the Semantic Web is increasing. Some systems have adopted IR (Information Retrieval) approaches to index and search the Semantic Web data due to its capability to handle the Web-scale data and efficiency on query answering. Additionally, the huge volumes of data on the Semantic Web are frequently updated. Thus, it further requires effective update mechanisms for these systems to handle the data change. However, the existing update approaches only focus on document. It still remains a big challenge to update IR index specially designed for semantic data in the form of finer grained structured objects rather than unstructured documents. In this paper, we present a well-designed update mechanism on the IR index for triples. Our approach provides a flexible and effective update mechanism by dividing the index into blocks. It reduces the number of update operations during the insertion of triples. At the same time, it preserves the efficiency on query processing and the capability to handle large scale semantic data. Experimental results show that the index update time is a fraction of that by complete reconstruction w.r.t. the portion of the inserted triples. Moreover, the query response time is not notably affected. Thus, it is capable to make newly arrived semantic data immediately searchable for users.