An algorithm for string matching with a sequence of don't cares
Information Processing Letters
Identifying word correspondence in parallel texts
HLT '91 Proceedings of the workshop on Speech and Natural Language
New indices for text: PAT Trees and PAT arrays
Information retrieval
Translating collocations for bilingual lexicons: a statistical approach
Computational Linguistics
Query expansion using local and global document analysis
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
PAT-tree-based keyword extraction for Chinese information retrieval
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Phrasal translation and query expansion techniques for cross-language information retrieval
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
PATRICIA—Practical Algorithm To Retrieve Information Coded in Alphanumeric
Journal of the ACM (JACM)
IRAL '00 Proceedings of the fifth international workshop on on Information retrieval with Asian languages
Harvesting translingual vocabulary mappings for multilingual digital libraries
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
Cross-lingual relevance models
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Translation of web queries using anchor text mining
ACM Transactions on Asian Language Information Processing (TALIP)
Subject categorization of query terms for exploring Web users' search interests
Journal of the American Society for Information Science and Technology
Anchor Text Mining for Translation of Web Queries
ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Automatic generation of English/Chinese thesaurus based on a parallel corpus in laws
Journal of the American Society for Information Science and Technology
Automatic construction of English/Chinese parallel corpora
Journal of the American Society for Information Science and Technology
A program for aligning sentences in bilingual corpora
Computational Linguistics - Special issue on using large corpora: I
An IR approach for translating new words from nonparallel, comparable texts
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Automatic identification of word translations from unrelated English and German corpora
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Mining the Web for bilingual text
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Introduction to the special topic section on multilingual information systems
Journal of the American Society for Information Science and Technology
Automatic extraction of translations from web-based bilingual materials
Machine Translation
A delimiter-based general approach for Chinese term extraction
Journal of the American Society for Information Science and Technology
PAKDD'09 Proceedings of the 13th Pacific-Asia international conference on Knowledge discovery and data mining: new frontiers in applied data mining
Improving machine transliteration performance by using multiple transliteration models
ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
Extracting english-korean transliteration pairs from web corpora
ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
Data Mining and Knowledge Discovery
Hi-index | 0.00 |
Users' cross-lingual queries to a digital library system might be short and the query terms may not be included in a common translation dictionary (unknown terms). In this article, the authors investigate the feasibility of exploiting the Web as the multilingual corpus source to translate unknown query terms for cross-language information retrieval in digital libraries. They propose a Web-based term translation approach to determine effective translations for unknown query terms by mining bilingual search-result pages obtained from a real Web search engine. This approach can enhance the construction of a domain-specific bilingual lexicon and bring multilingual support to a digital library that only has monolingual document collections. Very promising results have been obtained in generating effective translation equivalents for many unknown terms, including proper nouns, technical terms, and Web query terms, and in assisting bilingual lexicon construction for a real digital library system. © 2006 Wiley Periodicals, Inc.