A method for personalized clustering in data intensive web applications

  • Authors:
  • Maria Rigou;Spiros Sirmakessis;Giannis Tzimas

  • Affiliations:
  • University of Patras;TEI of Messolonghi;Patras University

  • Venue:
  • Proceedings of the joint international workshop on Adaptivity, personalization & the semantic web
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The paper introduces an algorithm for personalized clustering based on a range tree structure, used for identifying all web documents satisfying a set of predefined personal user preferences. The returned documents go through a clustering phase before reaching the end user, thus allowing more effective manipulation and supporting the decision making process. The proposed algorithm demonstrates increased applicability in semantic web settings, since they offer the infrastructure for the explicit declaration of web document attributes and their respective values, thus allowing for more automated retrieval. The proposed algorithm improves the k-means range algorithm, as it uses the already constructed range tree (i.e. during the personalized filtering phase) as the basic structure on which the clustering step is based, applying instead of the k-means, the k-windows algorithm. The total number of parameters used for modeling the web documents dictates the number of dimensions of the Euclidean space representation. The time complexity of the algorithm is O(logd-2n+v), where d is the number of dimensions, n is the total number of web documents and v is the size of the answer.