Extraction of temporal facts and events from Wikipedia

Authors:
Erdal Kuzey;Gerhard Weikum
Affiliations:
Max Planck Institute for Informatics, Saarbrücken, Germany;Max Planck Institute for Informatics, Saarbrücken, Germany
Venue:
Proceedings of the 2nd Temporal Web Analytics Workshop
Year:
2012

Citing 14
Cited 2

Maintaining knowledge about temporal intervals

Communications of the ACM
Web-scale information extraction in knowitall: (preliminary results)

Proceedings of the 13th international conference on World Wide Web
Yago: a core of semantic knowledge

Proceedings of the 16th international conference on World Wide Web
Temporal processing with the TARSQI toolkit

COLING '08 22nd International Conference on on Computational Linguistics: Demonstration Papers
Intelligence in wikipedia

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
SemEval-2007 task 15: TempEval temporal relation identification

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
SemEval-2010 task 13: evaluating events, time expressions, and temporal relations (TempEval-2)

DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions
Timely YAGO: harvesting, querying, and visualizing temporal knowledge from Wikipedia

Proceedings of the 13th International Conference on Extending Database Technology
DBpedia: a nucleus for a web of open data

ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
HeidelTime: High quality rule-based extraction and normalization of temporal expressions

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Temporal analysis of document collections: framework and applications

SPIRE'10 Proceedings of the 17th international conference on String processing and information retrieval
YAGO2: exploring and querying world knowledge in time, space, context, and many languages

Proceedings of the 20th international conference companion on World wide web
Harvesting facts from textual web sources by constrained label propagation

Proceedings of the 20th ACM international conference on Information and knowledge management
Coupled temporal scoping of relational facts

Proceedings of the fifth ACM international conference on Web search and data mining

On the use of semantic knowledge bases for temporally-aware entity retrieval

Proceedings of the fifth workshop on Exploiting semantic annotations in information retrieval
Knowledge harvesting in the big-data era

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recently, large-scale knowledge bases have been constructed by automatically extracting relational facts from text. Unfortunately, most of the current knowledge bases focus on static facts and ignore the temporal dimension. However, the vast majority of facts are evolving with time or are valid only during a particular time period. Thus, time is a significant dimension that should be included in knowledge bases. In this paper, we introduce a complete information extraction framework that harvests temporal facts and events from semi-structured data and free text of Wikipedia articles to create a temporal ontology. First, we extend a temporal data representation model by making it aware of events. Second, we develop an information extraction method which harvests temporal facts and events from Wikipedia infoboxes, categories, lists, and article titles in order to build a temporal knowledge base. Third, we show how the system can use its extracted knowledge for further growing the knowledge base. We demonstrate the effectiveness of our proposed methods through several experiments. We extracted more than one million temporal facts with precision over 90% for extraction from semi-structured data and almost 70% for extraction from text.