Ontology Based Semantic Similarity Comparison of Documents

  • Authors:
  • Vladimir Oleshchuk;Asle Pedersen

  • Affiliations:
  • -;-

  • Venue:
  • DEXA '03 Proceedings of the 14th International Workshop on Database and Expert Systems Applications
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this work we consider ontologies as knowledgestructures that specify terms, their properties and relationsamong them to enable knowledge extraction fromtexts. We represent ontologies using a graph-basedmodel that reflect semantic relationship between conceptsand apply them to text analysis and comparison.Instead of raw document comparison we compare documentfootprint enhanced with concepts from the ontology(using different enhancement algorithms). Theresult of this process may be that documents not similarprior to the enhancement become similar (semanticallyon some abstraction level) after the enhancement.This is because the enhancement process may introducein the document footprint abstract concepts from theontology. Using the ontology we can enhance the foot-printsby adding concepts that are not present in theoriginal document. We may use synonyms for a horizontalexpansion and broader terms/superclasses/typesin a vertical expansion or both for that matter.