Relabeling distantly supervised training data for temporal knowledge base population

  • Authors:
  • Suzanne Tamang;Heng Ji

  • Affiliations:
  • City University of New York, New York, NY;City University of New York, New York, NY

  • Venue:
  • AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We enhance a temporal knowledge base population system to improve the quality of distantly supervised training data and identify a minimal feature set for classification. The approach uses multi-class logistic regression to eliminate individual features based on the strength of their association with a temporal label followed by semi-supervised relabeling using a subset of human annotations and lasso regression. As implemented in this work, our technique improves performance and results in notably less computational cost than a parallel system trained on the full feature set.