Utilizing variability of time and term content, within and across users in session detection

  • Authors:
  • Shuqi Sun;Sheng Li;Muyun Yang;Haoliang Qi;Tiejun Zhao

  • Affiliations:
  • Harbin Institute of Technology;Harbin Institute of Technology;Harbin Institute of Technology;Heilongjiang Institute of Technology;Harbin Institute of Technology

  • Venue:
  • COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we describe a SVM classification framework of session detection task on both Chinese and English query logs. With eight features on the aspects of temporal and content information extracted from pairs of successive queries, the classification models achieve significantly superior performance than the stat-of-the-art method. Additionally, we find through ROC analysis that there exists great discrimination power variability among different features and within the same feature across different users. To fully utilize this variability, we build local models for individual users and combine their predictions with those from the global model. Experiments show that the local models do make significant improvements to the global model, although the amount is small.