Content-based text querying with ontological descriptors

  • Authors:
  • Troels Andreasen;Per Anker Jensen;Jørgen Fischer Nilsson;Patrizia Paggio;Bolette Sandford Pedersen;Hanne Erdman Thomsen

  • Affiliations:
  • Computer Science, Roskilde University, DK-4000 Roskilde, Denmark;Business Communication and Information Science, University of Southern Denmark, DK -6000 Kolding, Denmark;Informatics and Mathematical Modelling, Technical University of Denmark, DK-2800 Lyngby, Denmark;Centre for Language Technology, DK-2300 Copenhagen, Denmark;Centre for Language Technology, DK-2300 Copenhagen, Denmark;Computational Linguistics, Copenhagen Business School, DK-2000 Frederiksberg, Denmark

  • Venue:
  • Data & Knowledge Engineering - NLDB2002
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a method and a system for content-based querying of texts based on the availability of an ontology for the concepts in the text domain. A key principle in the system is the extraction of conceptual content of noun phrases into descriptors forming an integral part of the ontology.The retrieval of text passages rests on matching descriptors from the text against descriptors from the noun phrases in the query. The match does not need to be exact but is mediated by the ontology invoking in particular taxonomic reasoning with sub- and super-concepts. The paper also reports on a prototype implementation of the system.