DocBase: the INEX evaluation experience

  • Authors:
  • Sriram Mohan;Arijit Sengupta

  • Affiliations:
  • Computer Science Department, Indiana University, Bloomington, IN;Information Systems Department, Kelley School of Business, Indiana University, Bloomington, IN

  • Venue:
  • INEX'04 Proceedings of the Third international conference on Initiative for the Evaluation of XML Retrieval
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Can a system designed primarily for the purpose of database-type storage and retrieval be used for information-retrieval tasks? This was one of the questions that led us to participate in the INEX 2004 initiative. DocBase, a prototype database system developed initially for SGML, and adapted to work with XML, was used for the purpose of answering the queries. DocBase uses DSQL, an adaptation of SQL to provide a mechanism for querying XML using existing database and indexing technologies. The INEX evaluation experience was encouraging - although it did show the limitations of database query languages for classic information retrieval tasks, it also demonstrated that several interesting results can be obtained by using database query languages for information retrieval, especially for queries involving both content and structure. Our results demonstrate the adaptability and scalability of a database system for processing IR queries.