Integrating web conceptual modeling and web usage mining

  • Authors:
  • Rosa Meo;Pier Luca Lanzi;Maristella Matera;Roberto Esposito

  • Affiliations:
  • Università di Torino, Italy;Politecnico di Milano, Italy;Politecnico di Milano, Italy;Università di Torino, Italy

  • Venue:
  • WebKDD'04 Proceedings of the 6th international conference on Knowledge Discovery on the Web: advances in Web Mining and Web Usage Analysis
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a case study about the application of the inductive database approach to the analysis of Web logs. We consider rich XML Web logs – called conceptual logs – that are generated by Web applications designed with the WebML conceptual model and developed with the WebRatio CASE tool. Conceptual logs integrate the usual information about user requests with meta-data concerning the structure of the content and the hypertext of a Web application. We apply a data mining language (MINE RULE) to conceptual logs in order to identify different types of patterns, such as: recurrent navigation paths, most frequently visited page contents, and anomalies (e.g., intrusion attempts or harmful usages of resources). We show that the exploitation of the nuggets of information embedded in the logs and of the specialized mining constructs provided by the query languages enables the rapid customization of the mining procedures following to the Web developers’ need. Given our on-field experience, we also suggest that the use of queries in advanced languages, as opposed to ad-hoc heuristics, eases the specification and the discovery of large spectrum of patterns.