On the Selection of the Best Retrieval Result Per Query ---An Alternative Approach to Data Fusion---

  • Authors:
  • Antonio Juárez-González;Manuel Montes-Y-Gómez;Luis Villaseñor-Pineda;Daniel Ortíz-Arroyo

  • Affiliations:
  • Laboratory of Language Technologies, Department of Computational Sciences, National Institute of Astrophysics, Optics and Electronics (INAOE), Mexico;Laboratory of Language Technologies, Department of Computational Sciences, National Institute of Astrophysics, Optics and Electronics (INAOE), Mexico;Laboratory of Language Technologies, Department of Computational Sciences, National Institute of Astrophysics, Optics and Electronics (INAOE), Mexico;Department of Computer Science and Engineering, Aalborg University, Esbjerg, Denmark

  • Venue:
  • FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Some recent works have shown that the "perfect" selection of the best IR system per query could lead to a significant improvement on the retrieval performance. Motivated by this fact, in this paper we focus on the automatic selection of the best retrieval result from a given set of results lists generated by different IR systems. In particular, we propose five heuristic measures for evaluating the relative relevance of each result list, which take into account the redundancy and ranking of documents across the lists. Preliminary results in three different data sets, and considering 216 queries, are encouraging. They show that the proposed approach could slightly outperform the results from the best individual IR system in two out of three collections, but that it could significantly improve the average results of individual systems from all data sets. In addition, the achieved results indicate that our approach is a competitive alternative to traditional data fusion methods.