Relational Databases for Querying XML Documents: Limitations and Opportunities
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Evolving a Set of DTDs According to a Dynamic Set of XML Documents
EDBT '02 Proceedings of the Worshops XMLDM, MDDE, and YRWS on XML-Based Data Management and Multimedia Engineering-Revised Papers
XEM: Managing the Evolution of XML Documents
RIDE '01 Proceedings of the 11th International Workshop on research Issues in Data Engineering
Impact of XML schema evolution on valid documents
Proceedings of the 7th annual ACM international workshop on Web information and data management
Detecting changes on unordered XML documents using relational databases: a schema-conscious approach
Proceedings of the 14th ACM international conference on Information and knowledge management
DTD-Diff: A change detection algorithm for DTDs
Data & Knowledge Engineering
XANDY: detecting changes on large unordered XML documents using relationalDatabases
DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
How to break MD5 and other hash functions
EUROCRYPT'05 Proceedings of the 24th annual international conference on Theory and Applications of Cryptographic Techniques
DTD-Diff: A change detection algorithm for DTDs
Data & Knowledge Engineering
Extensible User-Based XML Grammar Matching
ER '09 Proceedings of the 28th International Conference on Conceptual Modeling
Using visual pages analysis for optimizing web archiving
Proceedings of the 2010 EDBT/ICDT Workshops
Vi-DIFF: understanding web pages changes
DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part I
Edit distance between XML and probabilistic XML documents
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
E-Metadata versioning system for data warehouse schema
International Journal of Metadata, Semantics and Ontologies
Extracting differences between regular tree grammars
Proceedings of the 28th Annual ACM Symposium on Applied Computing
E-Metadata versioning system for data warehouse schema
International Journal of Metadata, Semantics and Ontologies
Hi-index | 0.00 |
The DTD of a set of XML documents may change due to many reasons such as changes to the real-world events, changes to the user's requirements, and mistakes in the initial design. In this paper, we present a novel algorithm called DTD-Diff to detect the changes to DTDs that defines the structure of a set of XML documents. Such change detection tool can be useful in several ways such as maintenance of XML documents, incremental maintenance of relational schema for storing XML data, and XML schema integration. We compare DTD-Diff with existing XML change detection approaches and show that converting DTD to XML schema (XSD) (which is in XML document format) and detecting the changes using existing XML change detection algorithms is not a feasible option. Our experimental results show that DTD-Diff is 5-325 times faster than X-Diff when it detects the changes to the XSD files. Compared to XyDiff, DTD-Diff is up to 38 times faster. We also study the result quality of detected deltas.