Applying machine learning diversity metrics to data fusion in information retrieval

  • Authors:
  • David Leonard;David Lillis;Lusheng Zhang;Fergus Toolan;Rem W. Collier;John Dunnion

  • Affiliations:
  • School of Computer Science and Informatics, University College Dublin, Ireland;School of Computer Science and Informatics, University College Dublin, Ireland;School of Computer Science and Informatics, University College Dublin, Ireland;School of Computer Science and Informatics, University College Dublin, Ireland;School of Computer Science and Informatics, University College Dublin, Ireland;School of Computer Science and Informatics, University College Dublin, Ireland

  • Venue:
  • ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Supervised Machine Learning task of classification has parallels with Information Retrieval (IR): in each case, items (documents in the case of IR) are required to be categorised into discrete classes (relevant or non-relevant). Thus a parallel can also be drawn between classifier ensembles, where evidence from multiple classifiers are combined to achieve a superior result, and the IR data fusion task. This paper presents preliminary experimental results on the applicability of classifier ensemble diversity metrics in data fusion. Initial results indicate a relationship between the quality of the fused result set (as measured by MAP) and the diversity of its inputs