The Wikipedia XML corpus

  • Authors:
  • Ludovic Denoyer;Patrick Gallinari

  • Affiliations:
  • Laboratoire d'Informatique de Paris, Paris;Laboratoire d'Informatique de Paris, Paris

  • Venue:
  • ACM SIGIR Forum
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Wikipedia is a well known free content, multilingual encyclopedia written collaboratively by contributors around the world. Anybody can edit an article using a wiki markup language that offers a simplified alternative to HTML. This encyclopedia is composed of millions of articles in different languages.