A reasonable rough approximation for clustering web users

  • Authors:
  • Duoqian Miao;Min Chen;Zhihua Wei;Qiguo Duan

  • Affiliations:
  • Department of Computer Science and Technology, Tongji University, Shanghai, China and The Key Laboratory of Embedded System and Service Computing, Ministry of Education, China;Department of Computer Science and Technology, Tongji University, Shanghai, China and The Key Laboratory of Embedded System and Service Computing, Ministry of Education, China;Department of Computer Science and Technology, Tongji University, Shanghai, China and The Key Laboratory of Embedded System and Service Computing, Ministry of Education, China;Department of Computer Science and Technology, Tongji University, Shanghai, China and The Key Laboratory of Embedded System and Service Computing, Ministry of Education, China

  • Venue:
  • WImBI'06 Proceedings of the 1st WICI international conference on Web intelligence meets brain informatics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Due to the uncertainty in accessing Web pages, analysis of Web logs faces some challenges. Several rough k-means cluster algorithms have been proposed and successfully applied to Web usage mining. However, they did not explain why rough approximations of these cluster algorithms were introduced. This paper analyzes the characteristics of the data in the boundary areas of clusters, and then a rough k-means cluster algorithm based on a reasonable rough approximation (RKMrra) is proposed. Finally RKMrra is applied to Web access logs. In the experiments RKMrra compares to Lingras and West algorithm and Peters algorithm with respect to five characteristics. The results show that RKMrra discovers meaningful clusters of Web users and its rough approximation is more reasonable.