Dynamic Element Retrieval in the Wikipedia Collection

Authors:
Carolyn J. Crouch;Donald B. Crouch;Nachiket Kamat;Vikram Malik;Aditya Mone
Affiliations:
Department of Computer Science, University of Minnesota Duluth, Duluth MN 55812, (218) 726-7607;Department of Computer Science, University of Minnesota Duluth, Duluth MN 55812, (218) 726-7607;Department of Computer Science, University of Minnesota Duluth, Duluth MN 55812, (218) 726-7607;Department of Computer Science, University of Minnesota Duluth, Duluth MN 55812, (218) 726-7607;Department of Computer Science, University of Minnesota Duluth, Duluth MN 55812, (218) 726-7607
Venue:
Focused Access to XML Documents
Year:
2008

Citing 6
Cited 2

Pivoted document length normalization

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
A vector space model for automatic indexing

Communications of the ACM
Extending the boolean and vector space models of information retrieval with p-norm queries and multiple concept types

Extending the boolean and vector space models of information retrieval with p-norm queries and multiple concept types
The SMART Retrieval System—Experiments in Automatic Document Processing

The SMART Retrieval System—Experiments in Automatic Document Processing
Dynamic element retrieval in a structured environment

ACM Transactions on Information Systems (TOIS)
The dynamic retrieval of XML elements

INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval

A methodology for producing improved focused elements

INEX'09 Proceedings of the Focused retrieval and evaluation, and 8th international conference on Initiative for the evaluation of XML retrieval
Contextualization using hyperlinks and internal hierarchical structure of Wikipedia documents

Proceedings of the 21st ACM international conference on Information and knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes the successful adaptation of our methodology for the dynamic retrieval of XML elements to a semi-structured environment. Working with text that contains both tagged and untagged elements presents particular challenges in this context. Our system is based on the Vector Space Model; basic functions are performed using the Smart experimental retrieval system. Dynamic element retrieval requires only a single indexing of the document collection at the level of the basic indexing node (i.e., the paragraph). It returns a rank-ordered list of elements identical to that produced by the same query against an all-element index of the collection. Experimental results are reported for both the 2006 and 2007 Ad-hoc tasks.