Dynamic Element Retrieval in the Wikipedia Collection

  • Authors:
  • Carolyn J. Crouch;Donald B. Crouch;Nachiket Kamat;Vikram Malik;Aditya Mone

  • Affiliations:
  • Department of Computer Science, University of Minnesota Duluth, Duluth MN 55812, (218) 726-7607;Department of Computer Science, University of Minnesota Duluth, Duluth MN 55812, (218) 726-7607;Department of Computer Science, University of Minnesota Duluth, Duluth MN 55812, (218) 726-7607;Department of Computer Science, University of Minnesota Duluth, Duluth MN 55812, (218) 726-7607;Department of Computer Science, University of Minnesota Duluth, Duluth MN 55812, (218) 726-7607

  • Venue:
  • Focused Access to XML Documents
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes the successful adaptation of our methodology for the dynamic retrieval of XML elements to a semi-structured environment. Working with text that contains both tagged and untagged elements presents particular challenges in this context. Our system is based on the Vector Space Model; basic functions are performed using the Smart experimental retrieval system. Dynamic element retrieval requires only a single indexing of the document collection at the level of the basic indexing node (i.e., the paragraph). It returns a rank-ordered list of elements identical to that produced by the same query against an all-element index of the collection. Experimental results are reported for both the 2006 and 2007 Ad-hoc tasks.