Analyzing the properties of XML fragments decomposed from the INEX document collection

Authors:
Kenji Hatano;Hiroko Kinutani;Toshiyuki Amagasa;Yasuhiro Mori;Masatoshi Yoshikawa;Shunsuke Uemura
Affiliations:
Nara Institute of Science and Technology, Japan;Ochanomizu University, Japan;Nara Institute of Science and Technology, Japan;Nagoya University, Japan;Nagoya University, Japan;Nara Institute of Science and Technology, Japan
Venue:
INEX'04 Proceedings of the Third international conference on Initiative for the Evaluation of XML Retrieval
Year:
2004

Citing 10
Cited 2

Passage retrieval revisited

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
BUS: an effective indexing and retrieval scheme in structured documents

Proceedings of the third ACM conference on Digital libraries
XRel: a path-based approach to storage and retrieval of XML documents using relational databases

ACM Transactions on Internet Technology (TOIT)
Information Retrieval System for XML Documents

DEXA '02 Proceedings of the 13th International Conference on Database and Expert Systems Applications
XRANK: ranked keyword search over XML documents

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
DBXplorer: A System for Keyword-Based Search over Relational Databases

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Storing and Querying Multiversion XML Documents using Durable Node Numbers

WISE '01 Proceedings of the Second International Conference on Web Information Systems Engineering (WISE'01) Volume 1 - Volume 1
Length normalization in XML retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
XSEarch: a semantic search engine for XML

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Hierarchical indexing and flexible element retrieval for structured document

ECIR'03 Proceedings of the 25th European conference on IR research

An algebraic query model for effective and efficient retrieval of XML fragments

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Implementation of a high-speed and high-precision XML information retrieval system on relational databases

INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

In current keyword-based XML fragment retrieval systems, various granules of XML fragments are returned as retrieval results. The number of the XML fragments is huge, so this adversely affects the index construction time and query processing time of the XML fragment retrieval systems if they cannot extract only the answer XML fragments with certainty. In this paper, we propose a method for determining XML fragments that are appropriate in keyword-based XML fragment retrieval. This would help to improve overall performance of XML fragment retrieval systems. The proposed method utilizes and analyzes statistical information of XML fragments based on a technique of the dynamics of terminology in quantitative linguistics. Moreover, our keyword-based XML fragment retrieval system runs on a relational database system. In this paper, we briefly explain the implementation of our system.