Resolving ambiguity for cross-language retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Managing gigabytes (2nd ed.): compressing and indexing documents and images
Managing gigabytes (2nd ed.): compressing and indexing documents and images
IRAL '00 Proceedings of the fifth international workshop on on Information retrieval with Asian languages
IRAL '00 Proceedings of the fifth international workshop on on Information retrieval with Asian languages
Statistical cross-language information retrieval using n-best query translations
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
TNO at CLEF-2001: Comparing Translation Resources
CLEF '01 Revised Papers from the Second Workshop of the Cross-Language Evaluation Forum on Evaluation of Cross-Language Information Retrieval Systems
Dictionary Methods for Cross-Lingual Information Retrieval
DEXA '96 Proceedings of the 7th International Conference on Database and Expert Systems Applications
Statistical transliteration for english-arabic cross language information retrieval
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Detection and translation of OOV terms prior to query time
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Bootstrapping dictionaries for cross-language information retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
ACM Transactions on Asian Language Information Processing (TALIP)
Multilingual modeling of cross-lingual spelling variants
Information Retrieval
FITE-TRT: a high quality translation technique for OOV words
Proceedings of the 2006 ACM symposium on Applied computing
ACM Transactions on Asian Language Information Processing (TALIP)
Named entity transliteration with comparable corpora
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Concept unification of terms in different languages for IR
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Mining key phrase translations from web corpora
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Statistical query translation models for cross-language information retrieval
ACM Transactions on Asian Language Information Processing (TALIP)
Translation disambiguation in web-based translation extraction for English-Chinese CLIR
Proceedings of the 2007 ACM symposium on Applied computing
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Chinese-English term translation mining based on semantic prediction
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Cross-lingual query suggestion using query logs of different languages
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
ACM Transactions on Information Systems (TOIS)
A Hybrid Technique for English-Chinese Cross Language Information Retrieval
ACM Transactions on Asian Language Information Processing (TALIP)
Advanced Information Retrieval
Electronic Notes in Theoretical Computer Science (ENTCS)
Concept unification of terms in different languages via web mining for Information Retrieval
Information Processing and Management: an International Journal
Mining bilingual data from the web with adaptively learnt patterns
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Chinese-English organization name translation based on correlative expansion
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Mining Synonymous Transliterations from the World Wide Web
ACM Transactions on Asian Language Information Processing (TALIP)
Exploiting query logs for cross-lingual query suggestions
ACM Transactions on Information Systems (TOIS)
PAKDD'09 Proceedings of the 13th Pacific-Asia international conference on Knowledge discovery and data mining: new frontiers in applied data mining
Fusion of multiple features and ranking SVM for web-based English-Chinese OOV term translation
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
SDDB: a self-dependent and data-based method for constructing bilingual dictionary from the web
APWeb'11 Proceedings of the 13th Asia-Pacific web conference on Web technologies and applications
Translation of unknown terms via web mining for information retrieval
AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
Mining OOV translations from mixed-language web pages for cross language information retrieval
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Translation techniques in cross-language information retrieval
ACM Computing Surveys (CSUR)
Bilingual lexicon extraction from comparable corpora using label propagation
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Mining a multilingual association dictionary from Wikipedia for cross-language information retrieval
Journal of the American Society for Information Science and Technology
Finding synonyms and other semantically-similar terms from coselection data
AWC '13 Proceedings of the First Australasian Web Conference - Volume 144
Hi-index | 0.00 |
There have been significant advances in Cross-Language Information Retrieval (CLIR) in recent years. One of the major remaining reasons that CLIR does not perform as well as monolingual retrieval is the presence of out of vocabulary (OOV) terms. Previous work has either relied on manual intervention or has only been partially successful in solving this problem. We use a method that extends earlier work in this area by augmenting this with statistical analysis, and corpus-based translation disambiguation to dynamically discover translations of OOV terms. The method can be applied to both Chinese-English and English-Chinese CLIR, correctly extracting translations of OOV terms from the Web automatically, and thus is a significant improvement on earlier work.