Improving web sites with web usage mining, web content mining, and semantic analysis

  • Authors:
  • Jean-Pierre Norguet;Esteban Zimányi;Ralf Steinberger

  • Affiliations:
  • Department of Computer & Network Engineering, Université Libre de Bruxelles, Brussels, Belgium;Department of Computer & Network Engineering, Université Libre de Bruxelles, Brussels, Belgium;Joint Research Centre, European Commission, Ispra, (VA), Italy

  • Venue:
  • SOFSEM'06 Proceedings of the 32nd conference on Current Trends in Theory and Practice of Computer Science
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

With the emergence of the World Wide Web, Web sites have become a key communication channel for organizations. In this context, analyzing and improving Web communication is essential to better satisfy the objectives of the target audience. Web communication analysis is traditionnally performed by Web analytics software, which produce long lists of audience metrics. These metrics contain little semantics and are too detailed to be exploited by organization managers and chief editors, who need summarized and conceptual information to take decisions. Our solution to obtain such conceptual metrics is to analyze the content of the Web pages output by the Web server. In this paper, we first present a list of methods that we conceived to mine the output Web pages. Then, we explain how term weights in these pages can be used as audience metrics, and how they can be aggregated using OLAP tools to obtain concept-based metrics. Finally, we present the concept-based metrics that we obtained with our prototype WASA and SQL Server OLAP tools.