Discovering user access pattern based on probabilistic latent factor model

  • Authors:
  • Guandong Xu;Yanchun Zhang;Jiangang Ma;Xiaofang Zhou

  • Affiliations:
  • Victoria University, VIC, Australia;Victoria University, VIC, Australia;Victoria University, VIC, Australia;University of Queensland, Brisbane, Australia

  • Venue:
  • ADC '05 Proceedings of the 16th Australasian database conference - Volume 39
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

There has been an increased demand for characterizing user access patterns using web mining techniques since the informative knowledge extracted from web server log files can not only offer benefits for web site structure improvement but also for better understanding of user navigational behavior. In this paper, we present a web usage mining method, which utilize web user usage and page linkage information to capture user access pattern based on Probabilistic Latent Semantic Analysis (PLSA) model. A specific probabilistic model analysis algorithm, EM algorithm, is applied to the integrated usage data to infer the latent semantic factors as well as generate user session clusters for revealing user access patterns. Experiments have been conducted on real world data set to validate the effectiveness of the proposed approach. The results have shown that the presented method is capable of characterizing the latent semantic factors and generating user profile in terms of weighted page vectors, which may reflect the common access interest exhibited by users among same session cluster.