Experiments in multilingual information retrieval using the SPIDER system
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
QUILT: implementing a large-scale cross-language text retrieval system
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Translingual information retrieval: learning from bilingual corpora
Artificial Intelligence - Special issue: artificial intelligence 40 years later
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Cross-Language Information Retrieval
Cross-Language Information Retrieval
Multilingual Information Access
ESSIR '00 Proceedings of the Third European Summer-School on Lectures on Information Retrieval-Revised Lectures
Parallel Strands: A Preliminary Investigation into Mining the Web for Bilingual Text
AMTA '98 Proceedings of the Third Conference of the Association for Machine Translation in the Americas on Machine Translation and the Information Soup
Mining the Web for bilingual text
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Detection and translation of OOV terms prior to query time
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
ACM Transactions on Asian Language Information Processing (TALIP)
Stemming to improve translation lexicon creation form bitexts
Information Processing and Management: an International Journal
A fast method for parallel document identification
NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Adaptive web mining of bilingual lexicons for cross language information retrieval
Proceedings of the 18th ACM conference on Information and knowledge management
Automatic dictionary creation by sub-symbolic encoding of words
WIRN'05 Proceedings of the 16th Italian conference on Neural Nets
Mining OOV translations from mixed-language web pages for cross language information retrieval
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Automatic acquisition of chinese–english parallel corpus from the web
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Translation techniques in cross-language information retrieval
ACM Computing Surveys (CSUR)
Hi-index | 0.00 |
In this paper we describe a system for automatically constructing a bilingual dictionary for cross-language information retrieval applications. We describe how we automatically target candidate parallel documents, filter the candidate documents and process them to create parallel sentences. The parallel sentences are then automatically translated using an adaptation of the EMIM technique and a dictionary of translation terms is created. We evaluate our dictionary using human experts. The evaluation showed that the system performs well. In addition the results obtained from automatically-created corpora are comparable to those obtained from manually created corpora of parallel documents. Compared to other available techniques, our approach has the advantage of being simple, uniform, and easy-to-implement while providing encouraging results.