Improving query translation in English-Korean cross-language information retrieval

Authors:
Hee-Cheol Seo;Sang-Bum Kim;Hae-Chang Rim;Sung-Hyon Myaeng
Affiliations:
Department of Computer Science and Engineering, Korea University, 5 ka, Anam-dong, Seongbuk-ku, Seoul 136-701, Korea;Department of Computer Science and Engineering, Korea University, 5 ka, Anam-dong, Seongbuk-ku, Seoul 136-701, Korea;Department of Computer Science and Engineering, Korea University, 5 ka, Anam-dong, Seongbuk-ku, Seoul 136-701, Korea;Information and Communications University, 58-4 Hwaam-dong, Yuseong-gu, Daejeon 305-732, Korea
Venue:
Information Processing and Management: an International Journal - Special issue: Cross-language information retrieval
Year:
2005

Citing 8
Cited 2

Similarity-based approaches to natural language processing

Similarity-based approaches to natural language processing
Resolving ambiguity for cross-language retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A comparison of query translation methods for English-Japanese cross-language information retrieval (poster abstract)

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Using Statistical Term Similarity for Sense Disambiguationin Cross-Language Information Retrieval

Information Retrieval
Cross-Language Information Retrieval in a Multilingual Legal Domain

ECDL '97 Proceedings of the First European Conference on Research and Advanced Technology for Digital Libraries
Similarity-based methods for word sense disambiguation

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Should we translate the documents or the queries in cross-language information retrieval?

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Using mutual information to resolve query translation ambiguities and query term weighting

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics

Query translation-based cross-language print defect diagnosis based on the fuzzy Bayesian model

Journal of Intelligent Manufacturing
Flat vs. hierarchical phrase-based translation models for cross-language information retrieval

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Query translation is a viable method for cross-language information retrieval (CLIR), but it suffers from translation ambiguities caused by multiple translations of individual query terms. Previous research has employed various methods for disambiguation, including the method of selecting an individual target query term from multiple candidates by comparing their statistical associations with the candidate translations of other query terms. This paper proposes a new method where we examine all combinations of target query term translations corresponding to the source query terms, instead of looking at the candidates for each query term and selecting the best one at a time. The goodness value for a combination of target query terms is computed based on the association value between each pair of the terms in the combination. We tested our method using the NTCIR-3 English Korean-CLIR test collection. The results show some improvements regardless of the association measures we used.