The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Translating unknown queries with web corpora for cross-language information retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Iterative translation disambiguation for cross-language information retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
HLT-NAACL-PARALLEL '03 Proceedings of the HLT-NAACL 2003 Workshop on Building and using parallel texts: data driven machine translation and beyond - Volume 3
Learning transliteration lexicons from the web
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Information Processing and Management: an International Journal - Special issue: AIRS2005: Information retrieval research in Asia
Expert Systems with Applications: An International Journal
Chinese-Japanese cross language information retrieval: a Han character based approach
WorkSense '00 Proceedings of the ACL-2000 Workshop on Word Senses and Multi-Linguality
Expert Systems with Applications: An International Journal
Hi-index | 12.05 |
This paper describes our Japanese-Chinese information retrieval system. Our system takes the ''query-translation'' approach. Our system employs both a more conventional bilingual Japanese-Chinese dictionary and Wikipedia for translating query terms. We propose that Wikipedia can be used as a good NE bilingual dictionary. By exploiting the nature of Japanese writing system, we propose that query terms be processed differently based on the forms they are written in. We use an iterative method for weight-tuning and term disambiguation, which is based on the PageRank algorithm. When evaluating on the NTCIR-5 test set, our system achieves as high as 0.2217 and 0.2276 in relax MAP (mean average precision) measurement of T-runs and D-runs.