An Information-Theoretic Definition of Similarity
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Comparing Hierarchical Data in External Memory
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
XEdge: clustering homogeneous and heterogeneous XML documents using edge summaries
Proceedings of the 2008 ACM symposium on Applied computing
Structural similarity evaluation between XML documents and DTDs
WISE'07 Proceedings of the 8th international conference on Web information systems engineering
XCLS: a fast and effective clustering algorithm for heterogenous XML documents
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Web Semantics: Science, Services and Agents on the World Wide Web
Hi-index | 0.00 |
XML has experimented a rapid growth mostly because of its application on the Web. Application varies from version control management, data storage to clustering and information retrieval. In this context, it is necessary to develop efficient techniques for comparing XML documents. Many method proposed are based only on structural commonalities, ignoring semantics. In this paper, we propose a new method for comparing XML documents based on LevelEdge combining tag structural and semantic similarities.