On the editing distance between unordered labeled trees
Information Processing Letters
Algorithm 479: A minimal spanning tree clustering method
Communications of the ACM
Introduction to Algorithms
XClust: clustering XML schemas for effective integration
Proceedings of the eleventh international conference on Information and knowledge management
A bag of paths model for measuring structural similarity in Web documents
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Information Systems - Special issue on web data integration
Measuring similarity of semi-structured documents with context weights
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Fuzzy Similarity from Conceptual Relations
APSCC '06 Proceedings of the 2006 IEEE Asia-Pacific Conference on Services Computing
XML schema clustering with semantic and hierarchical similarity measures
Knowledge-Based Systems
Investigating Semantic Measures in XML Clustering
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
An approach to XML path matching
Proceedings of the 9th annual ACM international workshop on Web information and data management
Introduction to Information Retrieval
Introduction to Information Retrieval
A methodology for clustering XML documents by structure
Information Systems
Hi-index | 0.00 |
Clustering XML documents semantically has become a major challenge in XML data managements. The key research issue is to find the similarity functions of XML documents. However, previous work gave more importance to the topology structure than to the semantic information. In this paper, the computation of similarity between two XML documents is based on both structural and semantic information. Then a minimal spanning tree clustering method is used to cluster XML documents. The experiment results show that the new method performs better than baseline similarity measure in terms of purity and rand index.