XEdge: clustering homogeneous and heterogeneous XML documents using edge summaries

  • Authors:
  • Panagiotis Antonellis;Christos Makris;Nikos Tsirakis

  • Affiliations:
  • University of Patras, Greece, Rio, Patras;University of Patras, Greece, Rio, Patras;University of Patras, Greece, Rio, Patras

  • Venue:
  • Proceedings of the 2008 ACM symposium on Applied computing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we propose a unified clustering algorithm for both homogeneous and heterogeneous XML documents. Depending on the type of the XML documents, the proposed algorithm modifies its distance metric in order to properly adapt to the special structural characteristics of homogeneous and heterogeneous XML documents. We compare the quality of the formed clusters with those of one of the latest XML clustering algorithms and show that our algorithm outperforms it in the case of both homogeneous and heterogeneous XML documents.