Phrasal translation and query expansion techniques for cross-language information retrieval
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Evaluating document retrieval in patent database: a preliminary report
CIKM '97 Proceedings of the sixth international conference on Information and knowledge management
Translingual information retrieval: learning from bilingual corpora
Artificial Intelligence - Special issue: artificial intelligence 40 years later
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A patent search and classification system
Proceedings of the fourth ACM conference on Digital libraries
Unsupervised and supervised clustering for topic tracking
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Information Retrieval
A systematic comparison of various statistical alignment models
Computational Linguistics
Probabilistic structured query methods
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Word alignment of English-Chinese bilingual corpus based on chunks
EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
SIGIR '06 The 29th Annual International SIGIR Conference
Improving the estimation of relevance models using large external corpora
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Combining bidirectional translation and synonymy for cross-language information retrieval
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Term feedback for information retrieval with language models
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
User-assisted query translation for interactive cross-language information retrieval
Information Processing and Management: an International Journal
Introduction to Information Retrieval
Introduction to Information Retrieval
German Compounds in Factored Statistical Machine Translation
GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Proceedings of the 1st ACM workshop on Patent information retrieval
Conference on Information and Knowledge Management
The patent mining task in the seventh NTCIR workshop
Proceedings of the 1st ACM workshop on Patent information retrieval
Toward a more rational patent search paradigm
Proceedings of the 1st ACM workshop on Patent information retrieval
Query dependent pseudo-relevance feedback based on wikipedia
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Language-independent bilingual terminology extraction from a multilingual parallel corpus
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Pseudo-aligned multilingual corpora
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Automatic query generation for patent search
Proceedings of the 18th ACM conference on Information and knowledge management
Proceedings of the 2nd international workshop on Patent information retrieval
Conference on Information and Knowledge Management
FindCite: automatically finding prior art patents
Proceedings of the 2nd international workshop on Patent information retrieval
Exploiting query logs for cross-lingual query suggestions
ACM Transactions on Information Systems (TOIS)
Search system requirements of patent analysts
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Automatically generating queries for prior art search
CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
Improved unsupervised sentence alignment for symmetrical and asymmetrical parallel corpora
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Iterative Integration of Visual Insights during Scalable Patent Search and Analysis
IEEE Transactions on Visualization and Computer Graphics
Statistical machine translation of german compound words
FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
An investigation of decompounding for cross-language patent search
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Expanding queries with term and phrase translations in patent retrieval
IRFC'11 Proceedings of the Second international conference on Multidisciplinary information retrieval facility
Analyzing parallelism and domain similarities in the MAREC patent corpus
IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
Hi-index | 0.00 |
Patent retrieval is a branch of Information Retrieval (IR) aiming to support patent professionals in retrieving patents that satisfy their information needs. Often, patent granting bodies require patents to be partially translated into one or more major foreign languages, so that language boundaries do not hinder their accessibility. This multilinguality of patent collections offers opportunities for improving patent retrieval. In this work we exploit these opportunities by applying query translation to patent retrieval. We expand monolingual patent queries with their translations, using both a domain-specific patent dictionary that we extract from the patent collection, and a general domain-free dictionary. Experimental evaluation on a standard CLEF-IP dataset shows that using either translation dictionary fetches similar results: query translation can help patent retrieval, but not always, and without great improvement compared to standard statistical monolingual query expansion (Rocchio). The improvement is greater when the source language is English, as opposed to French or German, a finding partly due to the effect of the complex French and German morphology upon translation accuracy, but also partly due to the prevalence of English in the collection. A thorough per-query analysis reveals that cases where standard query expansion fails (e.g. zero recall) can benefit from query translation.