Using natural language to integrate, evaluate, and optimize extracted knowledge bases

  • Authors:
  • Doug Downey;Chandra Sekhar Bhagavatula;Alexander Yates

  • Affiliations:
  • Northwestern University, Evanston, IL, USA;Northwestern University, Evanston, IL, USA;Temple University, Philadelphia, PA, USA

  • Venue:
  • Proceedings of the 2013 workshop on Automated knowledge base construction
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Web Information Extraction (WIE) systems extract billions of unique facts, but integrating the assertions into a coherent knowledge base and evaluating across different WIE techniques remains a challenge. We propose a framework that utilizes natural language to integrate and evaluate extracted knowledge bases (KBs). In the framework, KBs are integrated by exchanging probability distributions over natural language, and evaluated by how well the output distributions predict held-out text. We describe the advantages of the approach, and detail remaining research challenges.