Semantic cores for representing documents in IR

  • Authors:
  • Mustapha Baziz;Mohand Boughanem;Nathalie Aussenac-Gilles;Claude Chrisment

  • Affiliations:
  • IRIT-Université Paul Sabatier, France;IRIT-Université Paul Sabatier, France;IRIT-Université Paul Sabatier, France;IRIT-Université Paul Sabatier, France

  • Venue:
  • Proceedings of the 2005 ACM symposium on Applied computing
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper deals with the use of ontologies for Information Retrieval. Roughly, the proposed approach consists in identifying important concepts in documents using two criterions, co-occurrence and semantic relatedness and then disambiguating them via an external general purpose ontology, namely WordNet. Matching the ontology and a document results in a set of scored concept-senses (nodes) with weighted links. This representation, called semantic core of a document best reveals the semantic content of the document. We regard our approach, of which the first evaluation results are encouraging, as a short but strong step toward the long term goal of Intelligent Indexing and Semantic Retrieval.