Using genetic algorithms for query reformulation

  • Authors:
  • José R. Pérez-Agüera

  • Affiliations:
  • Dept. Software Engineering and AI, Facultad de Informática, University Complutense of Madrid

  • Venue:
  • FDIA'07 Proceedings of the 1st BCS IRSG conference on Future Directions in Information Access
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Nowadays, searching information in the web or in any kind of document collection has become one of the most frequent activities. However, user queries can be formulated in a way that hinders the recovery of the requested information. The objective of automatic query transformation is to improve the quality of the recovered information. This paper describes a new genetic algorithm used to change the set of terms that compose a user query without user supervision, by complementing an expansion process based on the use of a morphological thesaurus. We apply a stemming process to obtain the stem of a word, for which the thesaurus provides its different forms. The set of candidate query terms is constructed by expanding each term in the original query with the terms morphologically related. The genetic algorithm is in charge of selecting the terms of the final query from the candidate term set. The selection process is based on the retrieval results obtained when searching with different combination of candidate terms. The algorithm shows improvement over some other using standard collections.