Comparative study of monolingual and multilingual search models for use with asian languages

Authors:
Jacques Savoy
Affiliations:
Université de Neuchâtel, Neuchâtel, Switzerland
Venue:
ACM Transactions on Asian Language Information Processing (TALIP)
Year:
2005

Citing 26
Cited 12

On term selection for query expansion

Journal of Documentation
A comparison of indexing techniques for Japanese text retrieval

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Using n-grams for Korean text retrieval

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Statistical inference in retrieval effectiveness evaluation

Information Processing and Management: an International Journal
Employing multiple representations for Chinese information retrieval

Journal of the American Society for Information Science
Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Experimentation as a way of life: Okapi at TREC

Information Processing and Management: an International Journal - The sixth text REtrieval conference (TREC-6)
Database merging strategy based on logistic regression

Information Processing and Management: an International Journal
An information-theoretic approach to automatic query expansion

ACM Transactions on Information Systems (TOIS)
Modeling score distributions for combining the outputs of search engines

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Predicting query performance

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Probabilistic models of information retrieval based on measuring the divergence from randomness

ACM Transactions on Information Systems (TOIS)
Fusion Via a Linear Combination of Scores

Information Retrieval
Using Corpus-Based Approaches in a System for Multilingual Information Retrieval

Information Retrieval
Using Statistical Translation Models for Bilingual IR

CLEF '01 Revised Papers from the Second Workshop of the Cross-Language Evaluation Forum on Evaluation of Cross-Language Information Retrieval Systems
Probabilistic approaches to topic detection and tracking

Topic detection and tracking
A comparison of Chinese document indexing strategies and retrieval models

ACM Transactions on Asian Language Information Processing (TALIP)
Cross-Language Evaluation Forum: Objectives, Results, Achievements

Information Retrieval
Combining Multiple Strategies for Effective Monolingual and Cross-Language Retrieval

Information Retrieval
Multilingual Information Retrieval Using Machine Translation, Relevance Feedback and Decompounding

Information Retrieval
How Effective is Stemming and Decompounding for German Text Retrieval?

Information Retrieval
A program for aligning sentences in bilingual corpora

Computational Linguistics - Special issue on using large corpora: I
Chinese word segmentation and its effect on information retrieval

Information Processing and Management: an International Journal
Lexicon-based orthographic disambiguation in CJK intelligent information retrieval

COLING '02 Proceedings of the 3rd workshop on Asian language resources and international standardization - Volume 12
Report on thomson legal and regulatory experiments at CLEF-2004

CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images
Data fusion for effective european monolingual information retrieval

CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images

Adapting pivoted document-length normalization for query size: Experiments in Chinese and English

ACM Transactions on Asian Language Information Processing (TALIP)
A weighted string pattern matching-based passage ranking algorithm for video question answering

Expert Systems with Applications: An International Journal
BVideoQA: Online English-Chinese bilingual video question answering

Journal of the American Society for Information Science and Technology
Investigation in statistical language-independent approaches for opinion detection in English, Chinese and Japanese

CLIAWS3 '09 Proceedings of the Third International Workshop on Cross Lingual Information Access: Addressing the Information Need of Multilingual Societies
Multilingual knowledge management

Artificial intelligence
Ad hoc retrieval with the Persian language

CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
Selecting automatically the best query translations

Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
A Survey of Automatic Query Expansion in Information Retrieval

ACM Computing Surveys (CSUR)
A new passage ranking algorithm for video question answering

PSIVT'06 Proceedings of the First Pacific Rim conference on Advances in Image and Video Technology
Translation techniques in cross-language information retrieval

ACM Computing Surveys (CSUR)
Experiments with monolingual, bilingual, and robust retrieval

CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
Collaborative pseudo-relevance feedback

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Based on the NTCIR-4 test-collection, our first objective is to present an overview of the retrieval effectiveness of nine vector-space and two probabilistic models that perform monolingual searches in the Chinese, Japanese, Korean, and English languages. Our second goal is to analyze the relative merits of the various automated and freely available toolsto translate the English-language topics into Chinese, Japanese, or Korean, and then submit the resultant query in order to retrieve pertinent documents written in one of the three Asian languages. We also demonstrate how bilingual searches could be improved by applying both the combined query translation strategies and data-fusion approaches. Finally, we address basic problems related to multilingual searches, in which queries written in English are used to search documents written in the English, Chinese, Japanese, and Korean languages.