On Modelling and Synthetically Generating Web Usage Data

  • Authors:
  • Peter I. Hofgesang;Jan Peter Patist

  • Affiliations:
  • -;-

  • Venue:
  • WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

While the whole public web is a potential source for web content and web structure mining, the actual usage information, that is essential for web usage mining (WUM), is kept hidden by web servers of hosted websites. Furthermore, there are only a handful of poorly described web access datasets publicly available. On the one hand, the lack of public datasets hamper WUM research, while on the other hand, online services demand for advanced techniques, e.g. to profile their customers and personalise their web based services. In this paper we propose our methodology to build synthetic web usage data generators based on the knowledge established by an extensive analysis of five real-world web usage datasets.