Modern Information Retrieval
Editorial: special issue on learning from imbalanced data sets
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Simple BM25 extension to multiple weighted fields
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Combining the language model and inference network approaches to retrieval
Information Processing and Management: an International Journal - Special issue: Bayesian networks and information retrieval
Proposal of two-stage patent retrieval method considering the claim structure
ACM Transactions on Asian Language Information Processing (TALIP)
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Enhancing patent retrieval by citation analysis
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
BioPatentMiner: an information retrieval system for biomedical patents
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Mining, indexing, and searching for textual chemical molecule information on the web
Proceedings of the 17th international conference on World Wide Web
Detection of IUPAC and IUPAC-like chemical names
Bioinformatics
Introduction to Information Retrieval
Introduction to Information Retrieval
Transforming patents into prior-art queries
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Automatic query generation for patent search
Proceedings of the 18th ACM conference on Information and knowledge management
Building queries for prior-art search
IRFC'11 Proceedings of the Second international conference on Multidisciplinary information retrieval facility
High-Throughput identification of chemistry in life science texts
CompLife'06 Proceedings of the Second international conference on Computational Life Sciences
Evaluation of result merging strategies for metasearch engines
WISE'05 Proceedings of the 6th international conference on Web Information Systems Engineering
Leveraging conceptual lexicon: query disambiguation using proximity information for patent retrieval
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Cross-language patent matching via an international patent classification-based concept bridge
Journal of Information Science
Hi-index | 0.00 |
Rapid increase in global competition demands increased protection of intellectual property rights and underlines the importance of patents as major intellectual property documents. Prior art patent search is the task of identifying related patents for a given patent file, and is an essential step in judging the validity of a patent application. This article proposes an automated query generation and postprocessing method for prior art patent search. The proposed approach first constructs structured queries by combining terms extracted from different fields of a query patent and then reranks the retrieved patents by utilizing the International Patent Classification (IPC) code similarities between the query patent and the retrieved patents along with the retrieval score. An extensive set of empirical results carried out on a large-scale, real-world dataset shows that utilizing 20 or 30 query terms extracted from all fields of an original query patent according to their log(tf)idf values helps form a representative search query out of the query patent and is found to be more effective than is using any number of query terms from any single field. It is shown that combining terms extracted from different fields of the query patent by giving higher importance to terms extracted from the abstract, claims, and description fields than to terms extracted from the title field is more effective than treating all extracted terms equally while forming the search query. Finally, utilizing the similarities between the IPC codes of the query patent and retrieved patents is shown to be beneficial to improve the effectiveness of the prior art search. © 2012 Wiley Periodicals, Inc.