Session boundary detection for association rule learning using n-gram language models

  • Authors:
  • Xiangji Huang;Fuchun Peng;Aijun An;Dale Schuurmans;Nick Cercone

  • Affiliations:
  • School of Computer Science, University of Waterloo, Waterloo, Ontario, Canada;School of Computer Science, University of Waterloo, Waterloo, Ontario, Canada;Department of Computer Science, York University, Toronto, Ontario, Canada;School of Computer Science, University of Waterloo, Waterloo, Ontario, Canada;Faculty of Computer Science, Dalhousie University, Halifax, Nova Scotia, Canada

  • Venue:
  • AI'03 Proceedings of the 16th Canadian society for computational studies of intelligence conference on Advances in artificial intelligence
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a statistical method using n-gram language models to identify session boundaries in a large collection of Livelink log data. The identified sessions are then used for association rule learning. Unlike the traditional ad hoc timeout method, which uses fixed time thresholds for session identification, our method uses an information theoretic approach that provides a natural technique for performing dynamic session identification. The effectiveness of our approach is evaluated with respect to 4 different interestingness measures. We find that we obtain a significant improvement in each interestingness measure, ranging from a 26.6% to 39% improvement on average over the best results obtained with standard timeout methods.