Lexical issues of a syntactic approach to interactive patent retrieval

  • Authors:
  • Eva D'hondt

  • Affiliations:
  • Centre for Language and Speech Technology, Radboud University Nijmegen, The Netherlands

  • Venue:
  • FDIA'09 Proceedings of the Third BCS-IRSG conference on Future Directions in Information Access
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Patent retrieval is an information retrieval task that poses very specific characteristics and demands. Especially the need for high recall is very important to patent searchers. In the ongoing research project TM4IP, we aim to improve patent retrieval by developing an open-domain patent retrieval system based on linguistic knowledge. By using Dependency Triplets as index terms our system aims to improve precision and recall compared to keyword-based approaches. One of the cornerstones of a syntactic approach to Information Retrieval is normalisation. This paper describes some of the characteristics of the patent domain that influence lexical normalisation.