Mining test oracles of web search engines

  • Authors:
  • Wujie Zheng;Hao Ma;Michael R. Lyu;Tao Xie;Irwin King

  • Affiliations:
  • Computer Science and Engineering, The Chinese University of Hong Kong, China;Internet Services Research Center (ISRC), Microsoft Research, Redmond, USA;Computer Science and Engineering, The Chinese University of Hong Kong, China;Department of Computer Science, North Carolina State University, USA;Computer Science and Engineering, The Chinese University of Hong Kong, China

  • Venue:
  • ASE '11 Proceedings of the 2011 26th IEEE/ACM International Conference on Automated Software Engineering
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Web search engines have major impact in people's everyday life. It is of great importance to test the retrieval effectiveness of search engines. However, it is labor-intensive to judge the relevance of search results for a large number of queries, and these relevance judgments may not be reusable since the Web data change all the time. In this work, we propose to mine test oracles of Web search engines from existing search results. The main idea is to mine implicit relationships between queries and search results, e.g., some queries may have fixed top 1 result while some may not, and some Web domains may appear together in top 10 results. We define a set of items of queries and search results, and mine frequent association rules between these items as test oracles. Experiments on major search engines show that our approach mines many high-confidence rules that help understand search engines and detect suspicious search results.