Weblogs as a source for extracting general world knowledge

  • Authors:
  • Jonathan Gordon;Benjamin Van Durme;Lenhart Schubert

  • Affiliations:
  • University of Rochester, Rochester, NY, USA;University of Rochester, Rochester, NY, USA;University of Rochester, Rochester, NY, USA

  • Venue:
  • Proceedings of the fifth international conference on Knowledge capture
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Knowledge extraction (KE) efforts have often used corpora of heavily edited writing and sources written to provide the desired knowledge (e.g., newspapers or textbooks). However, the proliferation of diverse, up-to-date, unedited writing on the Web, especially in weblogs, offers new challenges for KE tools. We describe our efforts to extract general knowledge implicit in this noisy data and examine whether such sources can be an adequate substitute for resources like Wikipedia.