XML documents clustering based on representative path

  • Authors:
  • Woosaeng Kim

  • Affiliations:
  • Department of Computer Science, Kwangwoon University, Nowon-Gu, Seoul, Korea

  • Venue:
  • ICCOMP'09 Proceedings of the WSEAES 13th international conference on Computers
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

XML is increasingly important in data exchange and information management. A large amount of efforts have been spent in developing efficient techniques for accessing, querying, and storing XML documents. In this paper, we propose a new method to cluster XML documents efficiently. A new prepresentative path called a virtul path which can represent both the structure and the contents of a XML document is proposed for the feature of a XML document. A method to apply the well known hierarchical clustering techniques to the representative paths to cluster XML documents is also proposed. The experiment shows that the true clusters are formed in a compact shape when a virtual path is used for the feature of a XML document.