A syntactic approach for searching similarities within sentences

  • Authors:
  • Federica Mandreoli;Riccardo Martoglia;Paolo Tiberio

  • Affiliations:
  • DII - Univ. di Modena e Reggio Emilia, Modena - Italy;DII - Univ. di Modena e Reggio Emilia, Modena - Italy;DII - Univ. di Modena e Reggio Emilia, Modena - Italy

  • Venue:
  • Proceedings of the eleventh international conference on Information and knowledge management
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Textual data is the main electronic form of knowledge representation. Sentences, meant as logic units of meaningful word sequences, can be considered its backbone. In this paper, we propose a solution based on a purely syntactic approach for searching similarities within sentences, named approximate sub2sequence matching. This process being very time consuming, efficiency in retrieving the most similar parts available in large repositories of textual data is ensured by making use of new filtering techniques. As far as the design of the system is concerned, we chose a solution that allows us to deploy approximate sub2 sequence matching without changing the underlying database.