A hybrid approach using pso and K-means for semantic clustering of web documents

  • Authors:
  • J. Avanija;K. Ramar

  • Affiliations:
  • Velammal College of Engineering & Technology, Tamilnadu, India;Einstein College of Engineering, Tamilnadu, India

  • Venue:
  • Journal of Web Engineering
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

With the massive growth and large volume of the web it is very difficult to recover results based on the user preferences. The next generation web architecture, semantic web reduces the burden of the user by performing search based on semantics instead of keywords. Even in the context of semantic technologies optimization problem occurs but rarely considered. In this paper Document clustering is applied to recover relevant documents. We propose a ontology based clustering algorithm using semantic similarity measure and Particle Swarm Optimization(PSO), which is applied to the annotated documents for optimizing the result. The proposed method uses Jena API and GATE tool API and the documents can be recovered based on their annotation features and relations. A preliminary experiment comparing the proposed method with K-Means shows that the proposed method is feasible and performs better than K-Means.