Simple fast algorithms for the editing distance between trees and related problems
SIAM Journal on Computing
Change detection in hierarchically structured information
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
XTRACT: a system for extracting document type descriptors from XML documents
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Evolving a Set of DTDs According to a Dynamic Set of XML Documents
EDBT '02 Proceedings of the Worshops XMLDM, MDDE, and YRWS on XML-Based Data Management and Multimedia Engineering-Revised Papers
A New Editing based Distance between Unordered Labeled Trees
CPM '93 Proceedings of the 4th Annual Symposium on Combinatorial Pattern Matching
Information Systems - Special issue on web data integration
Finding an optimum edit script between an XML document and a DTD
Proceedings of the 2005 ACM symposium on Applied computing
Measuring the structural similarity of semistructured documents using entropy
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
A bounded distance metric for comparing tree structure
Information Systems
Measuring structural similarity of semistructured data based on information-theoretic approaches
The VLDB Journal — The International Journal on Very Large Data Bases
Hi-index | 0.00 |
Measuring the structural similarity between two document-centric XML documents is a well-known problem. Many approaches have been proposed that, by fixing constraints in the evaluation of similarity, compute the similarity with polynomial time complexity. Despite this huge activity for document-centric XML documents, no approaches have been presented so far for data-centric XML documents (i.e., XML documents for which order among elements is not relevant) mainly because this problem is, in general, NP-complete. In this paper we propose an approach for measuring the structural similarity between two data-centric XML documents that, when the structure of elements with the same label is not heterogenous, is polynomial.