Relevance measures for XML information retrieval

Authors:
Olli Luoma
Affiliations:
Department of Information Technology, University of Turku, FIN-20014, Finland
Venue:
International Journal of Web and Grid Services
Year:
2007

Citing 20
Cited 0

Index structures for structured documents

Proceedings of the first ACM international conference on Digital libraries
Layered index structures in document database systems

Proceedings of the seventh international conference on Information and knowledge management
Querying and ranking XML documents

Journal of the American Society for Information Science and Technology - XML
Structured information retrieval in XML documents

Proceedings of the 2002 ACM symposium on Applied computing
HyREX: hyper-media retrieval engine for XML

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Accelerating XPath location steps

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
BitCube: A Three-Dimensional Bitmap Indexing for XML Documents

Journal of Intelligent Information Systems
M-tree: An Efficient Access Method for Similarity Search in Metric Spaces

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Relational Databases for Querying XML Documents: Limitations and Opportunities

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Information Retrieval System for XML Documents

DEXA '02 Proceedings of the 13th International Conference on Database and Expert Systems Applications
Length normalization in XML retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Controlling overlap in content-oriented XML retrieval

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Structure and content scoring for XML

VLDB '05 Proceedings of the 31st international conference on Very large data bases
A survey on tree edit distance and related problems

Theoretical Computer Science
The SMART Retrieval System—Experiments in Automatic Document Processing

The SMART Retrieval System—Experiments in Automatic Document Processing
Evaluation in (XML) information retrieval: expected precision-recall with user modelling (EPRUM)

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Modeling nested relationships in XML documents using relational databases

SOFSEM'05 Proceedings of the 31st international conference on Theory and Practice of Computer Science
Implementation of XPath axes in the multi-dimensional approach to indexing XML data

EDBT'04 Proceedings of the 2004 international conference on Current Trends in Database Technology
Supporting XPath axes with relational databases using a proxy index

XSym'05 Proceedings of the Third international conference on Database and XML Technologies
Ranked retrieval of structured documents with the s-term vector space model

INEX'04 Proceedings of the Third international conference on Initiative for the Evaluation of XML Retrieval

Quantified Score

Hi-index	0.01

Visualization

Abstract

In recent years, a lot of work has been carried out to develop efficient methods for storing and querying XML data. Most of the proposals have approached the subject from the database point of view, i.e., they have primarily aimed at providing exact matching capabilities. The problem can, however, also be addressed as an information-retrieval problem, which obviously introduces some challenges, such as the need for relevance ranking. The vast majority of the previous proposals have based the ranking primarily on content and, furthermore, if structural properties were taken into account, only containment relationships have been considered. In this paper, we focus on ranking the results based on their structural properties and aim at supporting a wide range of structural operations, such as operations based on preceding/following relationships. Our method is based on a fuzzy interpretation of the XPath query language which is also discussed in this paper. Finally, we discuss a relational implementation of our model and present the results of our experiments.