Patent search using IPC classification vectors

Authors:
Manisha Verma;Vasudeva Varma
Affiliations:
International Institute of Information Technology, Hyderabad, India;International Institute of Information Technology, Hyderabad, India
Venue:
Proceedings of the 4th workshop on Patent information retrieval
Year:
2011

Citing 8
Cited 0

Proposal of two-stage patent retrieval method considering the claim structure

ACM Transactions on Asian Language Information Processing (TALIP)
Cluster-based patent retrieval

Information Processing and Management: an International Journal
Enhancing patent retrieval by citation analysis

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
On the role of classification in patent invalidity searches

Proceedings of the 2nd international workshop on Patent information retrieval
Comparison of IPC and USPC classification systems in patent prior art searches

PaIR '10 Proceedings of the 3rd international workshop on Patent information retrieval
Simple pre and post processing strategies for patent searching in CLEF intellectual property track 2009

CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
An IPC-based vector space model for patent retrieval

Information Processing and Management: an International Journal
Cluster-based patent retrieval using international patent classification system

ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead

Quantified Score

Hi-index	0.00

Visualization

Abstract

Finding similar patents is a challenging task in patent information retrieval. A patent application is often a starting point to find similar inventions. Keyword search for similar patents requires significant domain expertise and may not fetch relevant results. We propose a novel representation for patents and use a two stage approach to find similar patents. Each patent is represented as an IPC class vector. Citation network of patents is used to propagate these vectors from a node (patent) to its neighbors (cited patents). Thus, each patent is represented as a weighted combination of its IPC information as well as of its neighbors. A query patent is represented as a vector using its IPC information and similar patents can be simply found by comparing this vector with vectors of patents in the corpus. Text based search is used to re-rank this solution set to improve precision. We experiment with two similarity measures and re-ranking strategies to empirically show that our representation is effective in improving both precision and recall of queries of CLEF-2011 dataset.