Improving the web usage analysis process: a UML model of the ETL process

  • Authors:
  • Thilo Maier

  • Affiliations:
  • Catholic University Eichstätt-Ingolstadt, Ingolstadt, Germany

  • Venue:
  • WebKDD'04 Proceedings of the 6th international conference on Knowledge Discovery on the Web: advances in Web Mining and Web Usage Analysis
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Integrating OLAP and Web usage analysis in a data warehousing environment is a promising approach for sophisticated analysis of the Web channel in multi-channel environments of organizations. Populating the data warehouse is a laborious and time-consuming task (especially for small projects), which is – in practice – a big obstacle for concrete ECRM projects. Especially if Web usage analysis researchers need to conduct experiments with a Web warehouse, an intuitive and easy to deploy ETL component is essential. In this paper we propose a logical object-oriented relational data storage model in UML, which is based on a formal model. A concrete Java instance of our model simplifies modeling and automating the ETL process. The Java instance of our model has been integrated into our WUSAN (Web USage ANalyis) system. Finally, we illustrate the usage of our model for Web usage analysis purposes, though the model is principally not restricted to this domain.