Ontology-driven discourse analysis for information extraction

  • Authors:
  • Philipp Cimiano;Uwe Reyle;Jasmin Šarić

  • Affiliations:
  • Institute AIFB, University of Karlsruhe, Karlsruhe, Germany;Institut für Maschinelle Sprachverarbeitung, University of Stuttgart, Stuttgart, Germany;EML Research gGmbH, Heidelberg, Germany

  • Venue:
  • Data & Knowledge Engineering - Special issue: Natural language and database and information systems: NLDB 03
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a novel approach to discourse analysis within information extraction systems. It makes use of DRT as formal representation of the linguistic context as well as of a domain-specific ontology as a basis to compute conceptual relations between extracted events thus establishing discourse coherence. The approach has been implemented within GenIE, an information extraction system with the aim of extracting information about biochemical pathways, about sequences, structures and functions of genomes and proteins. The approach is evaluated against a semantically hand-annotated set of Swiss-Prot protein function descriptions and shows very promising results.