Extracting events from wikipedia as RDF triples linked to widespread semantic web datasets

Authors:
Carlo Aliprandi;Francesco Ronzano;Andrea Marchetti;Maurizio Tesconi;Salvatore Minutoli
Affiliations:
Synthema Srl, Pisa, Italy;Institute of Informatics and Telematics (IIT) CNR, Pisa, Italy;Institute of Informatics and Telematics (IIT) CNR, Pisa, Italy;Institute of Informatics and Telematics (IIT) CNR, Pisa, Italy;Institute of Informatics and Telematics (IIT) CNR, Pisa, Italy
Venue:
OCSC'11 Proceedings of the 4th international conference on Online communities and social computing
Year:
2011

Citing 3
Cited 0

Slot Grammar: A System for Simpler Construction of Practical Natural Language Grammars

Proceedings of the International Symposium on Natural Language and Logic
Corroborate and learn facts from the web

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Fact Discovery in Wikipedia

WI '07 Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many attempts have been made to extract structured data from Web resources, exposing them as RDF triples and interlinking them with other RDF datasets: in this way it is possible to create clouds of highly integrated Semantic Web data collections. In this paper we describe an approach to enhance the extraction of semantic contents from unstructured textual documents, in particular considering Wikipedia articles and focusing on event mining. Starting from the deep parsing of a set of English Wikipedia articles, we produce a semantic annotation compliant with the Knowledge Annotation Format (KAF). We extract events from the KAF semantic annotation and then we structure each event as a set of RDF triples linked to both DBpedia and WordNet. We point out examples of automatically mined events, providing some general evaluation of how our approach may discover new events and link them to existing contents.