Active User-Based and Ontology-Based Web Log Data Preprocessing for Web Usage Mining

  • Authors:
  • Natheer Khasawneh;Chien-Chung Chan

  • Affiliations:
  • Jordan University of Science and Technology, Jordan;University of Akron, USA

  • Venue:
  • WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

User identification and session identification are two major steps in preprocessing web log data for web usage mining. This paper introduces a fast active user-based user identification algorithm with time complexity O(n). The algorithm uses both an IP address and a finite users' inactive time to identify different users in the web log. Website ontology is useful for identifying website structure and break points for browsing behavior. For session identification, we present an ontology-based method that utilizes the website structure and functionalities to identify different sessions.