Exploring structured documents and query formulation techniques for patent retrieval

Authors:
Walid Magdy;Johannes Leveling;Gareth J. F. Jones
Affiliations:
School of Computing, Dublin City University, Dublin 9, Ireland;School of Computing, Dublin City University, Dublin 9, Ireland;School of Computing, Dublin City University, Dublin 9, Ireland
Venue:
CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
Year:
2009

Citing 2
Cited 7

Using statistical testing in the evaluation of retrieval experiments

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
CLEF-IP 2009: retrieval experiments in the intellectual property domain

CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments

Patent query reduction using pseudo relevance feedback

Proceedings of the 20th ACM international conference on Information and knowledge management
United we fall, divided we stand: a study of query segmentation and prf for patent prior art search

Proceedings of the 4th workshop on Patent information retrieval
A study on query expansion methods for patent retrieval

Proceedings of the 4th workshop on Patent information retrieval
Utilizing sub-topical structure of documents for information retrieval

Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management
Learning-Based pseudo-relevance feedback for patent retrieval

IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
Analyzing query logs of USPTO examiners to identify useful query terms in patent documents for query expansion in patent searching: a preliminary study

IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
CV-PCR: a context-guided value-driven framework for patent citation recommendation

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents the experiments and results of DCU in CLEF-IP 2009. Our work applied standard information retrieval (IR) techniques to patent search. Different experiments tested various methods for the patent retrieval, including query formulation, structured index, weighted fields, document filtering, and blind relevance feedback. Some methods did not show expected good retrieval effectiveness such as blind relevance feedback, other experiments showed acceptable performance. Query formulation was the key to achieving better retrieval effectiveness, and this was performed through assigning higher weights to certain document fields. Further experiments showed that for longer queries, better results are achieved but at the expense of additional computations. For the best runs, the retrieval effectiveness is still lower than for IR applications for other domains, illustrating the difficulty of patent search. The official results have shown that among fifteen participants we achieved the seventh and the fourth ranks from the mean average precision (MAP) and recall point of view, respectively.