Analysis of a very large web search engine query log
ACM SIGIR Forum
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
What's new on the web?: the evolution of the web from a search engine perspective
Proceedings of the 13th international conference on World Wide Web
Adaptive web search based on user profile constructed without any effort from users
Proceedings of the 13th international conference on World Wide Web
Evaluating implicit measures to improve web search
ACM Transactions on Information Systems (TOIS)
Automatic identification of user goals in Web search
WWW '05 Proceedings of the 14th international conference on World Wide Web
Personalizing search via automated analysis of interests and activities
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
LA-WEB '05 Proceedings of the Third Latin American Web Congress
An experimental comparison of click position-bias models
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Web retrieval: Techniques for the aggregation and selection of queries and answers
International Journal of Intelligent Systems - Aggregation Operators for Information Systems
Beyond DCG: user behavior as a predictor of a successful search
Proceedings of the third ACM international conference on Web search and data mining
A user study of web search session behaviour using eye tracking data
BCS-HCI '12 Proceedings of the 26th Annual BCS Interaction Specialist Group Conference on People and Computers
Hi-index | 0.00 |
In this paper we present and empirically evaluate a 'continuity-cost model' for Internet query sessions made by users. We study the relation of different 'cost factors' for a user query session, with the continuity of the user in that query session, and the order of the query in the query session. We define cost indicators from the available query log data, which are to be studied in relation to continuity and to the order/number of the query (1st, 2nd, 3rd, ..). One of our hypotheses is that cost related factors will reflect the step by step nature of the query session process. We use descriptive statistics together with rule induction to identify the most relevant factors and observable trends, and produce three classifier data models, one for each 'query number', using the 'continuity flag' as classifier label. Using the cost factors, we identify trends relating continuity/query number to user behavior, and we can use that information, for example, to make decisions about caching and query recommendation.