Towards Knowledge Discovery from WWW Log Data

  • Authors:
  • Feng Tao;Fionn Murtagh

  • Affiliations:
  • -;-

  • Venue:
  • ITCC '00 Proceedings of the The International Conference on Information Technology: Coding and Computing (ITCC'00)
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

As the result of interactions between visitors and a web site, an http log file contains very rich knowledge about users on-site behaviors, which, if fully exploited, can better customer services and site performance. Different to most of the existing log analysis tools which use statistical counting summaries on pages, hosts, etc., we propose a transaction model to represent users access history and a framework to adapt data mining techniques such as sequence and association rule mining to these transactions. In this framework, all transactions are extracted from the raw log file though a series of systematic data preparation phases. We discuss different methods to identify a user, and separate long convoluted sequences into semantically meaningful sessions and transactions. A new feature called interestingness is defined to model user interests in different web sections. With all the transactions being imported into an adapted cube structure with a concept hierarchy attached to each dimension of it, it is possible to carry out multi-dimensional data mining at multi-abstract levels. Using interest context rules, we demonstrate the potentially significant meaning of this system prototype.