Generating representative Web workloads for network and server performance evaluation
SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Hi-index | 0.00 |
While the whole public web is a potential source for web content and web structure mining, the actual usage information, that is essential for web usage mining (WUM), is kept hidden by web servers of hosted websites. Furthermore, there are only a handful of poorly described web access datasets publicly available. On the one hand, the lack of public datasets hamper WUM research, while on the other hand, online services demand for advanced techniques, e.g. to profile their customers and personalise their web based services. In this paper we propose our methodology to build synthetic web usage data generators based on the knowledge established by an extensive analysis of five real-world web usage datasets.