How to define searching sessions on web search engines

  • Authors:
  • Bernard J. Jansen;Amanda Spink;Vinish Kathuria

  • Affiliations:
  • College of Information Sciences and Technology, The Pennsylvania State University, University Park, Pennsylvania;Faculty of Information Technology, Queensland University of Technology, Brisbane, QLD, Australia;Search Engineer, Infospace, Inc. - Search & Directory, Bellevue, WA

  • Venue:
  • WebKDD'06 Proceedings of the 8th Knowledge discovery on the web international conference on Advances in web mining and web usage analysis
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this research, we investigate three techniques for defining user sessions on Web search engines. We analyze 2,465,145 interactions from 534,507 Web searchers. We compare three methods for defining sessions using: 1) Internet Protocol address and cookie; 2) Internet Protocol address, cookie, and a temporal limit on intra-session interactions; and 3) Internet Protocol address, cookie, and query reformulation patterns. Research results shows that defining sessions by query reformulation provides the best measure of session identification, with a nearly 95% accuracy. This method also results in an 82% increase in the number of sessions compared to Internet Protocol address and cookie alone. Regardless of the method, mean session length was fewer than three queries and the mean session duration was less than 30 minutes. Implications are that unique sessions may be a better indicator than the common industry metric of unique visitors for measuring search traffic. Results of this research may lead to tools to better support Web searching.