Context-driven semantic enrichment of italian news archive

Authors:
Andrei Tamilin;Bernardo Magnini;Luciano Serafini;Christian Girardi;Mathew Joseph;Roberto Zanoli
Affiliations:
FBK, Center for Information Technology - IRST, Povo di Trento, Italy;FBK, Center for Information Technology - IRST, Povo di Trento, Italy;FBK, Center for Information Technology - IRST, Povo di Trento, Italy;FBK, Center for Information Technology - IRST, Povo di Trento, Italy;FBK, Center for Information Technology - IRST, Povo di Trento, Italy;FBK, Center for Information Technology - IRST, Povo di Trento, Italy
Venue:
ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part I
Year:
2010

Citing 6
Cited 1

Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema

ISWC '02 Proceedings of the First International Semantic Web Conference on The Semantic Web
Named graphs, provenance and trust

WWW '05 Proceedings of the 14th international conference on World Wide Web
Improving Web site understanding with keyword-based clustering

Journal of Software Maintenance and Evolution: Research and Practice
Social acquisition of ontologies from communication processes

Applied Ontology - Formal Ontologies for Communicating Agents
Media Meets Semantic Web --- How the BBC Uses DBpedia and Linked Data to Make Connections

ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
IRST-BP: web people search using name entities

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations

A contextualized knowledge framework for semantic web

ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

Semantic enrichment of textual data is the operation of linking mentions with the entities they refer to, and the subsequent enrichment of such entities with the background knowledge about them available in one or more knowledge bases (or in the entire web). Information about the context in which a mention occurs, (e.g., information about the time, the topic, and the space, which the text is relative to) constitutes a critical resource for a correct semantic enrichment for two reasons. First, without context, mentions are “too little text” to unambiguously refer to a single entity. Second, knowledge about entities is also context dependent (e.g., speaking about political life of Illinois during 1996, Obama is a Senator, while since 2009, Obama is the US president). In this paper, we describe a concrete approach to context-driven semantic enrichment, built upon four core sub-tasks: detection of mentions in text (i.e., finding references to people, locations and organizations); determination of the context of discourses of the text, identification of the referred entities in the knowledge base, and enrichment of the entity with the knowledge relevant to the context. In such approach, context-driven semantic enrichment needs also to have contextualized background knowledge. To cope with this aspect, we propose a customization of Sesame, one of state-of-the-art knowledge repositories, to support representation and reasoning with contextualized knowledge. The approach has been fully implemented in a system, which has been practically deployed and applied to the textual archive of the local Italian newspaper “L'Adige”, covering the decade of years from 1999 to 2009.