Normalization theory for XML

Authors:
Leonid Libkin
Affiliations:
School of Informatics, University of Edinburgh
Venue:
XSym'07 Proceedings of the 5th international conference on Database and XML Technologies
Year:
2007

Citing 17
Cited 5

An Infornation-Theoretic Analysis of Relational Databases Part I: Data Dependencies and Information Metric

IEEE Transactions on Software Engineering
A new normal form for the design of relational database schemata

ACM Transactions on Database Systems (TODS)
Synthesizing third normal form relations from functional dependencies

ACM Transactions on Database Systems (TODS)
Multivalued dependencies and a new normal form for relational databases

ACM Transactions on Database Systems (TODS)
Information dependencies

PODS '00 Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Keys for XML

Proceedings of the 10th international conference on World Wide Web
On XML integrity constraints in the presence of DTDs

Journal of the ACM (JACM)
Relational Databases for Querying XML Documents: Limitations and Opportunities

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Developing XML Documents with Guaranteed ``Good'' Properties

ER '01 Proceedings of the 20th International Conference on Conceptual Modeling: Conceptual Modeling
A normal form for XML documents

ACM Transactions on Database Systems (TODS)
Strong functional dependencies and their application to normal forms in XML

ACM Transactions on Database Systems (TODS)
An information-theoretic approach to normal forms for relational and XML data

Journal of the ACM (JACM)
Removing XML data redundancies using functional and equality-generating dependencies

ADC '05 Proceedings of the 16th Australasian database conference - Volume 39
On redundancy vs dependency preservation in normalization: an information-theoretic study of 3NF

Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Dependency-preserving normalization of relational and XML data

Journal of Computer and System Sciences
XML design for relational storage

Proceedings of the 16th international conference on World Wide Web
Multivalued dependencies and a 4NF for XML

CAiSE'03 Proceedings of the 15th international conference on Advanced information systems engineering

Modeling and Querying E-Commerce Data in Hybrid Relational-XML DBMSs

ER '08 Proceedings of the 27th International Conference on Conceptual Modeling
Extracting a largest redundancy-free XML storage structure from an acyclic hypergraph in polynomial time

Information Systems
Design non-recursive and redundant-free XML conceptual schema with hypergraph

DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications
Formal Framework of XML Document Schema Design

International Journal of Information Retrieval Research
Generating the fewest redundancy-free scheme trees from acyclic conceptual-model hypergraphs in polynomial time

Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Specifications of XML documents typically consist of typing information (e.g., a DTD), and integrity constraints. Just like relational schema specifications, not all are good - some are prone to redundancies and update anomalies. In the relational world we have a well-developed theory of data design (also known as normalization). A few definitions of XML normal forms have been proposed, but the main question is why a particular design is good. In the XML world, we still lack universally accepted query languages such as relational algebra, or update languages that let us reason about storage redundancies, lossless decompositions, and update anomalies. A better approach, therefore, is to come up with notions of good design based on the intrinsic properties of the model itself. We present such an approach, based on Shannon's information theory, and show how it applies to relational normal forms as well as to XML design, for both native and relational storage.