A Markovian approach for web user profiling and clustering

  • Authors:
  • Younes Hafri;Chabane Djeraba;Peter Stanchev;Bruno Bachimont

  • Affiliations:
  • Institut National de l'Audiovisuel, Bry-sur-Marne Cedex, France and Institut de Recherche en Informatique de Nantes, Nantes Cedex, France;Institut de Recherche en Informatique de Nantes, Nantes Cedex, France;Kettering University, Flint, MI;Institut National de l'Audiovisuel, Bry-sur-Marne Cedex, France

  • Venue:
  • PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

The objective of this paper is to propose an approach that extracts automatically web user profiling based on user navigation paths. Web user profiling consists of the best representative behaviors, represented by Markov models (MM). To achieve this objective, our approach is articulated around three notions: (1) Applying probabilistic exploration using Markov models. (2) Avoiding the problem of Markov model high-dimensionality and sparsity by clustering web documents, based on their content, before applying the Markov analysis. (3) Clustering Markov models, and extraction of their gravity centers. On the basis of these three notions, the approach makes possible the prediction of future states to be visited in k steps and navigation sessions monitoring, based on both content and traversed paths. The original application of the approach concerns the exploitation of multimedia archives in the perspective of the Copyright Deposit that preserves French's WWW documents. The approach may be the exploitation tool for any web site.