Coupling label propagation and constraints for temporal fact extraction

Authors:
Yafang Wang;Maximilian Dylla;Marc Spaniol;Gerhard Weikum
Affiliations:
Max Planck Institute for Informatics, Saarbrücken, Germany;Max Planck Institute for Informatics, Saarbrücken, Germany;Max Planck Institute for Informatics, Saarbrücken, Germany;Max Planck Institute for Informatics, Saarbrücken, Germany
Venue:
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Year:
2012

Citing 13
Cited 3

Maintaining knowledge about temporal intervals

Communications of the ACM
Machine learning of temporal relations

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Automating temporal annotation with TARSQI

ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Yago: a core of semantic knowledge

Proceedings of the 16th international conference on World Wide Web
Open information extraction from the web

Communications of the ACM - Surviving the data deluge
Jointly combining implicit constraints improves temporal ordering

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
New Regularized Algorithms for Transductive Learning

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Jointly identifying temporal relations with Markov Logic

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Coupled semi-supervised learning for information extraction

Proceedings of the third ACM international conference on Web search and data mining
DBpedia: a nucleus for a web of open data

ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
Harvesting facts from textual web sources by constrained label propagation

Proceedings of the 20th ACM international conference on Information and knowledge management
Coupled temporal scoping of relational facts

Proceedings of the fifth ACM international conference on Web search and data mining
Robust disambiguation of named entities in text

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing

PRAVDA-live: interactive knowledge harvesting

Proceedings of the 21st ACM international conference on Information and knowledge management
Knowledge harvesting in the big-data era

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
A temporal-probabilistic database model for information extraction

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Web and digitized text sources contain a wealth of information about named entities such as politicians, actors, companies, or cultural landmarks. Extracting this information has enabled the automated construction of large knowledge bases, containing hundred millions of binary relationships or attribute values about these named entities. However, in reality most knowledge is transient, i.e. changes over time, requiring a temporal dimension in fact extraction. In this paper we develop a methodology that combines label propagation with constraint reasoning for temporal fact extraction. Label propagation aggressively gathers fact candidates, and an Integer Linear Program is used to clean out false hypotheses that violate temporal constraints. Our method is able to improve on recall while keeping up with precision, which we demonstrate by experiments with biography-style Wikipedia pages and a large corpus of news articles.