Translating collocations for bilingual lexicons: a statistical approach
Computational Linguistics
Combining Multiple Strategies for Effective Monolingual and Cross-Language Retrieval
Information Retrieval
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Going beyond CLEF-IP: the 'reality' for patent searchers?
CLEF'12 Proceedings of the Third international conference on Information Access Evaluation: multilinguality, multimodality, and visual analytics
Leveraging conceptual lexicon: query disambiguation using proximity information for patent retrieval
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Hi-index | 0.00 |
This paper presents PATATRAS (PATent and Article Tracking, Retrieval and AnalysiS), a system realized at the Humboldt University for the IP track of CLEF 2009. Our approach presents three main characteristics: 1. The usage of multiple retrieval models and term index definitions for the three languages considered in the present track producing ten different sets of ranked results. 2. The merging of the different results based on multiple regression models using an additional training set created from the patent collection. 3. The exploitation of patent metadata and the citation structures for creating restricted initial working sets of patents and for producing a final re-ranking regression model. The resulting architecture allowed us to exploit efficiently specific information of patent documents while remaining generic and easy to extend.