ListOPT: learning to optimize for XML ranking

  • Authors:
  • Ning Gao;Zhi-Hong Deng;Hang Yu;Jia-Jian Jiang

  • Affiliations:
  • Key Laboratory of Machine Perception, Ministry of Education), School of Electronic Engineering and Computer Science, Peking University;Key Laboratory of Machine Perception, Ministry of Education), School of Electronic Engineering and Computer Science, Peking University and The State Key Lab of Computer Science, Institute of Softw ...;Key Laboratory of Machine Perception, Ministry of Education), School of Electronic Engineering and Computer Science, Peking University;Key Laboratory of Machine Perception, Ministry of Education), School of Electronic Engineering and Computer Science, Peking University

  • Venue:
  • PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
  • Year:
  • 2011
  • Combining strategies for XML retrieval

    INEX'10 Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrieval

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many machine learning classification technologies such as boosting, support vector machine or neural networks have been applied to the ranking problem in information retrieval. However, since the purpose of these learning-torank methods is to directly acquire the sorted results based on the features of documents, they are unable to combine and utilize the existing ranking methods proven to be effective such as BM25 and PageRank. To solve this defect, we conducted a study on learning-to-optimize, which is to construct a learning model or method for optimizing the free parameters in ranking functions. This paper proposes a listwise learning-to-optimize process ListOPT and introduces three alternative differentiable query-level loss functions. The experimental results on the XML dataset of Wikipedia English show that these approaches can be successfully applied to tuning the parameters used in an existing highly cited ranking function BM25. Furthermore, we found that the formulas with optimized parameters indeed improve the effectiveness compared with the original ones.