XIRQL: a query language for information retrieval in XML documents
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Querying and ranking XML documents
Journal of the American Society for Information Science and Technology - XML
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
Searching XML documents via XML fragments
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Using a compact tree to index and query XML data
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Processing content-oriented XPath queries
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Ctree: a compact tree for indexing XML data
Proceedings of the 6th annual ACM international workshop on Web information and data management
Controlling overlap in content-oriented XML retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Generalized contextualization method for XML information retrieval
Proceedings of the 14th ACM international conference on Information and knowledge management
A methodology for clustering XML documents by structure
Information Systems
Measuring similarity of semi-structured documents with context weights
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Preparing heterogeneous XML for full-text search
ACM Transactions on Information Systems (TOIS)
XML search: languages, INEX and scoring
ACM SIGMOD Record
Efficiently Querying Large XML Data Repositories: A Survey
IEEE Transactions on Knowledge and Data Engineering
An architecture for xml information retrieval in a peer-to-peer environment
Proceedings of the ACM first Ph.D. workshop in CIKM
RRSi: indexing XML data for proximity twig queries
Knowledge and Information Systems
Effective XML content and structure retrieval with relevance ranking
Proceedings of the 18th ACM conference on Information and knowledge management
A methodology for clustering XML documents by structure
Information Systems
Outline wizard: presentation composition and search
Proceedings of the 15th international conference on Intelligent user interfaces
CoXML: a cooperative XML query answering system
APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Book search experiments: investigating IR methods for the indexing and retrieval of books
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Retrieving samples from biobanks
ITBAM'10 Proceedings of the First international conference on Information technology in bio- and medical informatics
When a few highly relevant answers are enough
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
Relevance feedback for structural query expansion
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
Feedback-Driven structural query expansion for ranked retrieval of XML data
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Semantic relevance ranking for XML keyword search
Information Sciences: an International Journal
Flexible retrieval based on the vector space model
INEX'04 Proceedings of the Third international conference on Initiative for the Evaluation of XML Retrieval
Structural feedback for keyword-based XML retrieval
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Hi-index | 0.00 |
Indexing and ranking are two key factors for efficient and effective XML information retrieval. Inappropriate indexing may result in false negatives and false positives, and improper ranking may lead to low precisions. In this paper, we propose a configurable XML information retrieval system, in which users can configure appropriate index types for XML tags and text contents. Based on users' index configurations, the system transforms XML structures into a compact tree representation, Ctree, and indexes XML text contents. To support XML ranking, we propose the concepts of "weighted term frequency" and "inverted element frequency," where the weight of a term depends on its frequency and location within an XML element as well as its popularity among similar elements in an XML dataset. We evaluate the effectiveness of our system through extensive experiments on the INEX 03 dataset and 30 content and structure (CAS) topics. The experimental results reveal that our system has significantly high precision at low recall regions and achieves the highest average precision (0.3309) as compared with 38 official INEX 03 submissions using the strict evaluation metric.