Weighting common syntactic structures for natural language based information retrieval

  • Authors:
  • Chang Liu;Hui Wang;Sally McClean;Epaminondas Kapetanios;Denis Carroll

  • Affiliations:
  • University of Ulster, Jordanstown, Northern Ireland, United Kingdom;University of Ulster, Jordanstown, Northern Ireland, United Kingdom;University of Ulster, Coleraine, Northern Ireland, United Kingdom;University of Westminster, London, United Kingdom;University of Westminster, london, United Kingdom

  • Venue:
  • CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Natural Language Processing (NLP) techniques are believed to hold the potential to assist "bag-of-words" Information Retrieval (IR) in terms of retrieval accuracy. In this paper, we report a natural language based IR approach where the common syntactic structures between documents and the query is regarded to as a query-dependent feature for documents. Specifically, a "structural weight" is proposed for query terms, which can be seen as a weight to model the degree of term's involvement in the common syntactic structures. This structural weight is used together with the TF-IDF weighting scheme, which results in a new ranking function. The accumulation of this structural weight of all the query terms in the new ranking function will be seen as a measure of how much a document and a query share the common syntactic structures. The experimental results show that by using this ranking function, significant improvements in the retrieval performance are achieved.