Markovian analysis for automatic new topic identification in search engine transaction logs

  • Authors:
  • Huseyin C. Ozmutlu

  • Affiliations:
  • Industrial Engineering Department, Uludag University, Gorukle, Bursa, Turkey

  • Venue:
  • Applied Stochastic Models in Business and Industry
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Topic analysis of search engine user queries is an important task, since successful exploitation of the topic of queries can result in the design of new information retrieval algorithms for more efficient search engines. Identification of topic changes within a user search session is a key issue in analysis of search engine user queries. This study presents an application of Markov chains in the area of search engine research to automatically identify topic changes in a user session by using statistical characteristics of queries, such as time intervals, query reformulation patterns and the continuation-shift status of the previous query. The findings show that Markov chains provide fairly successful results for automatic new topic identification with a high level of estimation for topic continuations and shifts. Copyright © 2009 John Wiley & Sons, Ltd.