Combining strategies for XML retrieval

  • Authors:
  • Ning Gao;Zhi-Hong Deng;Jia-Jian Jiang;Sheng-Long Lv;Hang Yu

  • Affiliations:
  • Key Laboratory of Machine Perception, Ministry of Education, School of Electronic Engineering and Computer Science, Peking University;Key Laboratory of Machine Perception, Ministry of Education, School of Electronic Engineering and Computer Science, Peking University and The State Key Lab of Computer Science, Institute of Softwa ...;Key Laboratory of Machine Perception, Ministry of Education, School of Electronic Engineering and Computer Science, Peking University;Key Laboratory of Machine Perception, Ministry of Education, School of Electronic Engineering and Computer Science, Peking University;Key Laboratory of Machine Perception, Ministry of Education, School of Electronic Engineering and Computer Science, Peking University

  • Venue:
  • INEX'10 Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrieval
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes Peking University's approaches to the Ad Hoc, Data Centric and Relevance Feedback track. In Ad Hoc track, results for four tasks were submitted, Efficiency, Restricted Focused, Relevance In Context and Restricted Relevance In Context. To evaluate the relevance between documents and a given query, multiple strategies, such as Two-Step retrieval, MAXLCA query results, BM25, distribution measurements and learn-to-optimize method are combined to form a more effective search engine. In Data Centric track, to gain a set of closely related nodes that are collectively relevant to a given keyword query, we promote three factors, correlation, explicitnesses and distinctiveness. In Relevance Feedback track, to obtain useful information from feedbacks, our implementation employs two techniques, a revised Rocchio algorithm and criterion weight adjustment.