Finding Generalized Path Patterns for Web Log Data Mining

  • Authors:
  • Alexandros Nanopoulos;Yannis Manolopoulos

  • Affiliations:
  • -;-

  • Venue:
  • ADBIS-DASFAA '00 Proceedings of the East-European Conference on Advances in Databases and Information Systems Held Jointly with International Conference on Database Systems for Advanced Applications: Current Issues in Databases and Information Systems
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Conducting data mining on logs of web servers involves the determination of frequently occurring access sequences. We examine the problem of finding traversal patterns from web logs by considering the fact that irrelevant accesses to web documents may be interleaved within access patterns due to navigational purposes. We define a general type of pattern that takes into account this fact and also, we present a level-wise algorithm for the determination of these patterns, which is based on the underlying structure of the web site. The performance of the algorithm and its sensitivity to several parameters is examined experimentally with synthetic data.