Retrieval parameter optimization using genetic algorithms

Authors:
Sumio Fujita
Affiliations:
Yahoo Japan Corporation, Midtown-tower, 9-7-1 Akasaka, Minato-ku, Tokyo 107-6211, Japan
Venue:
Information Processing and Management: an International Journal
Year:
2009

Citing 22
Cited 2

Probabilistic and genetic algorithms in document retrieval

Communications of the ACM
Automatic text processing: the transformation, analysis, and retrieval of information by computer

Automatic text processing: the transformation, analysis, and retrieval of information by computer
A probabilistic learning approach for document indexing

ACM Transactions on Information Systems (TOIS) - Special issue on research and development in information retrieval
Adaptation in natural and artificial systems

Adaptation in natural and artificial systems
Probabilistic retrieval based on staged logistic regression

SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Probabilistic information retrieval as a combination of abstraction, inductive learning, and probabilistic assumptions

ACM Transactions on Information Systems (TOIS)
Query modification using genetic algorithms in vector space models

International Journal of Expert Systems
Inferring probability of relevance using the method of logistic regression

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Crossover improvement for the genetic algorithm in information retrieval

Information Processing and Management: an International Journal
Document language models, query models, and risk minimization for information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
A clustering strategy based on a formalism of the reproductive process in natural systems

SIGIR '79 Proceedings of the 2nd annual international ACM SIGIR conference on Information storage and retrieval: information implications into the eighties
Genetic Algorithms in Search, Optimization and Machine Learning

Genetic Algorithms in Search, Optimization and Machine Learning
Distributed Genetic Algorithms

Proceedings of the 3rd International Conference on Genetic Algorithms
Genetic algorithms in relevance feedback: a second test and new contributions

Information Processing and Management: an International Journal
Discriminative models for information retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Tuning before feedback: combining ranking discovery and blind feedback for robust retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A generic ranking function discovery framework by genetic programming for information retrieval

Information Processing and Management: an International Journal
Revisiting Document Length Hypotheses: A Comparative Study of Japanese Newspaper and Patent Retrieval

ACM Transactions on Asian Language Information Processing (TALIP)
Retrieval of Web Resources Using a Fusion of Ontology-Based and Content-Based Retrieval with the RS Vector Space Model on a Portal for Japanese Universities and Academic Institutes

HICSS '06 Proceedings of the 39th Annual Hawaii International Conference on System Sciences - Volume 03
Technology survey and invalidity search: A comparative study of different tasks for Japanese patent document retrieval

Information Processing and Management: an International Journal
A combined component approach for finding collection-adapted ranking functions based on genetic programming

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval

A hybrid system for dental milling parameters optimisation

HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part II
Applying soft computing techniques to optimise a dental milling process

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes our experiments on automatic parameter optimization for the Japanese monolingual retrieval task. Unlike regression approaches, we optimized parameters completely independently of retrieval models enabling the optimized parameter set to illustrate the characteristics of the target test collections. We adopted genetic algorithms as optimization tools and cross-validated with four test collections, namely the CLIR-J-J collections for NTCIR-3 to NTCIR-6. The most difficult retrieval parameters to optimize are the feedback parameters, because there are no principles for calibrating them. Our approach optimized feedback parameters and basic scoring parameters at the same time. Using test sets and validation sets, we achieved effectiveness levels comparable with very strong baselines, i.e., the best-performing NTCIR official runs.