A proposal for annotation, semantic similarity and classification of textual documents

  • Authors:
  • Emmanuel Nauer;Amedeo Napoli

  • Affiliations:
  • LORIA — UMR 7503, Vandœuvre-lès-Nancy cedex, France;LORIA — UMR 7503, Vandœuvre-lès-Nancy cedex, France

  • Venue:
  • AIMSA'06 Proceedings of the 12th international conference on Artificial Intelligence: methodology, Systems, and Applications
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present an approach for classifying documents based on the notion of a semantic similarity and the effective representation of the content of the documents. The content of a document is annotated and the resulting annotation is represented by a labeled tree whose nodes and edges are represented by concepts lying within a domain ontology. A reasoning process may be carried out on annotation trees, allowing the comparison of documents between each others, for classification or information retrieval purposes. An algorithm for classifying documents with respect to semantic similarity and a discussion conclude the paper.