A content-based approach for document representation and retrieval

  • Authors:
  • Antonio M. Rinaldi

  • Affiliations:
  • University of Napoli Federico II, Napoli, Italy

  • Venue:
  • Proceedings of the eighth ACM symposium on Document engineering
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the last few years, the problem of defining efficient techniques for knowledge representation is becoming a challenging topic in both academic and industrial community. The large amount of available data creates several problems in terms of information overload. In this framework, we assume that new approaches for knowledge definition and representation may be useful, in particular the ones based on the concept of ontology. In this paper we propose a suitable model for knowledge representation purposes using linguistic concepts and properties. We implement our model in a system which, using novel techniques and metrics, analyzes documents from a semantic point of view using as context of interest the Web. Experiments are performed on a test set built using a directory service to have information about analyzed documents. The obtained results compared with other similar systems show an effective improvement.