Retrieval activities in a database consisting of heterogeneous collections of structured text
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Algebras for querying text regions (extended abstract)
PODS '95 Proceedings of the fourteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Proximal nodes: a model to query document databases by content and structure
ACM Transactions on Information Systems (TOIS)
Integrating keyword search into XML query processing
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
On supporting containment queries in relational database management systems
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
XIRQL: a query language for information retrieval in XML documents
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Information Retrieval: Algorithms and Heuristics
Information Retrieval: Algorithms and Heuristics
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
The Mirror MMDBMS Architecture
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
XRANK: ranked keyword search over XML documents
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
XIRQL: An XML query language based on information retrieval concepts
ACM Transactions on Information Systems (TOIS)
Texquery: a full-text search extension to xquery
Proceedings of the 13th international conference on World Wide Web
Length normalization in XML retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Component ranking and automatic query refinement for XML retrieval
INEX'04 Proceedings of the Third international conference on Initiative for the Evaluation of XML Retrieval
GPX: gardens point XML information retrieval at INEX 2004
INEX'04 Proceedings of the Third international conference on Initiative for the Evaluation of XML Retrieval
TIJAH at INEX 2004 modeling phrases and relevance feedback
INEX'04 Proceedings of the Third international conference on Initiative for the Evaluation of XML Retrieval
The TIJAH XML information retrieval system
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Preparing heterogeneous XML for full-text search
ACM Transactions on Information Systems (TOIS)
Searching cultural heritage data: does structure help expert searchers?
RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
TIJAH scratches INEX 2005: vague element selection, image search, overlap, and relevance feedback
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
Extending information unit across media streams for improving retrieval effectiveness
Data & Knowledge Engineering
Hi-index | 0.00 |
A unified database framework that will enable better comprehension of ranked XML retrieval is still a challenge in the XML database field. We propose a logical algebra, named score region algebra, that enables transparent specification of information retrieval (IR) models for XML databases. The transparency is achieved by a possibility to instantiate various retrieval models, using abstract score functions within algebra operators, while logical query plan and operator definitions remain unchanged. Our algebra operators model three important aspects of XML retrieval: element relevance score computation, element score propagation, and element score combination. To illustrate the usefulness of our algebra we instantiate four different, well known IR scoring models, and combine them with different score propagation and combination functions. We implemented the algebra operators in a prototype system on top of a low-level database kernel. The evaluation of the system is performed on a collection of IEEE articles in XML format provided by INEX. We argue that state of the art XML IR models can be transparently implemented using our score region algebra framework on top of any low-level physical database engine or existing RDBMS, allowing a more systematic investigation of retrieval model behavior.