The Wikipedia XML Corpus

  • Authors:
  • Ludovic Denoyer;Patrick Gallinari

  • Affiliations:
  • Laboratoire d'Informatique de Paris 6, 8 rue du capitaine Scott, 75015 Paris, ;Laboratoire d'Informatique de Paris 6, 8 rue du capitaine Scott, 75015 Paris,

  • Venue:
  • Comparative Evaluation of XML Information Retrieval Systems
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article presents the general Wikipedia XML Collection developped for Structured Information Retrieval and Structured Machine Learning. This collection has been built from the Wikipedia Enclyclopedia. We detail particularly here which parts of this collection have been used during INEX 2006 for the Ad-hoc track and for the XML Mining track. Note that other tracks of INEX - multimedia track for example - have also been based on this collection.