Unsupervised learning for reranking-based patent retrieval

  • Authors:
  • wenhui liao;Sriharsha Veeramachaneni

  • Affiliations:
  • Thomson Reuters, Eagan, MN, USA;Thomson Reuters, Eagan, MN, USA

  • Venue:
  • PaIR '10 Proceedings of the 3rd international workshop on Patent information retrieval
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a reranking-based patent retrieval system where the query text is a patent claim, which may be from an existing patent. The novelty of our approach is the automatic generating of training data for learning the ranker. The ranking is based on several features of the candidate patent, such as the text similarity to the claim, international patent code overlap, and internal citation structure of the candidates. Our approach more than doubles the average number of relevant patents in the top 5 over a strong baseline retrieval system.