Similarity Measurement of Web Sessions by Sequence Alignment

  • Authors:
  • Chaofeng Li;Yansheng Lu

  • Affiliations:
  • South-Central University for Nationalities;Huazhong University of Science and Technology

  • Venue:
  • NPC '07 Proceedings of the 2007 IFIP International Conference on Network and Parallel Computing Workshops
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

The task of clustering web sessions is to group web sessions based on similarity and consists of maximizing the intra-group similarity while minimizing the inter-group similarity. The first and foremost question needed to be considered in clustering web sessions is how to measure the similarity between web sessions. However, there are many shortcomings in traditional measurements. This paper analyses the shortcomings of traditional methods and introduces a new method to measure similarities between web pages, which considers not only the URL but also the viewing time of the visited web page. Then we propose a new method to measure the similarity of web sessions using sequence alignment and the similarity of web page access. Finally, we conclude this paper and propose the future research directions.