IEEE Transactions on Software Engineering
Elements of information theory
Elements of information theory
Storing semistructured data with STORED
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
A new normal form for the design of relational database schemata
ACM Transactions on Database Systems (TODS)
Synthesizing third normal form relations from functional dependencies
ACM Transactions on Database Systems (TODS)
PODS '00 Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Proceedings of the 10th international conference on World Wide Web
XRel: a path-based approach to storage and retrieval of XML documents using relational databases
ACM Transactions on Internet Technology (TOIT)
On verifying consistency of XML specifications
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Foundations of Databases: The Logical Level
Foundations of Databases: The Logical Level
A Guided Tour of Relational Databases and Beyond
A Guided Tour of Relational Databases and Beyond
Storing and querying ordered XML using a relational database system
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Relational Databases for Querying XML Documents: Limitations and Opportunities
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
The Theory of Probabilistic Databases
VLDB '87 Proceedings of the 13th International Conference on Very Large Data Bases
Efficient Relational Storage and Retrieval of XML Documents
Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
Achievements of Relational Database Schema Design Theory Revisited
Selected Papers from a Workshop on Semantics in Databases
Developing XML Documents with Guaranteed ``Good'' Properties
ER '01 Proceedings of the 20th International Conference on Conceptual Modeling: Conceptual Modeling
Why is the snowflake schema a good data warehouse design?
Information Systems
A normal form for XML documents
ACM Transactions on Database Systems (TODS)
Strong functional dependencies and their application to normal forms in XML
ACM Transactions on Database Systems (TODS)
Database Systems: An Application Oriented Approach, Complete Version (2nd Edition)
Database Systems: An Application Oriented Approach, Complete Version (2nd Edition)
An information-theoretic approach to normal forms for relational and XML data
Journal of the ACM (JACM)
Native Xquery processing in oracle XMLDB
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Removing XML data redundancies using functional and equality-generating dependencies
ADC '05 Proceedings of the 16th Australasian database conference - Volume 39
Designing information-preserving mapping schemes for XML
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Native XML support in DB2 universal database
VLDB '05 Proceedings of the 31st international conference on Very large data bases
On redundancy vs dependency preservation in normalization: an information-theoretic study of 3NF
Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A linear time algorithm for optimal tree sibling partitioning and approximation algorithms in Natix
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
MARS: a system for publishing XML from mixed and redundant storage
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Functional dependencies for XML
APWeb'03 Proceedings of the 5th Asia-Pacific web conference on Web technologies and applications
Editorial: BioDB: An ontology-enhanced information system for heterogeneous biological information
Data & Knowledge Engineering
The implication problem for 'closest node' functional dependencies in complete XML documents
Journal of Computer and System Sciences
XSym'07 Proceedings of the 5th international conference on Database and XML Technologies
Hi-index | 0.00 |
Design principles for XML schemas that eliminate redundancies and avoid update anomalies have been studied recently. Several normal forms, generalizing those for relational databases, have been proposed. All of them, however, are based on the assumption of anative XML storage, while in practice most of XML data is stored inrelational databases. In this paper we study XML design and normalization for relational storage of XML documents. To be able to relate and compare XML and relational designs, we use an information-theoretic framework that measures information content in relations and documents, with higher values corresponding to lower levels of redundancy. We show that most common relational storage schemes preserve the notion of being well-designed (i.e., anomalies- and redundancy-free). Thus,existing XML normal forms guarantee well-designed relational storagesas well. We further show that if this perfect option is not achievable, then a slight restriction on XML constraints guarantees a "second-best" relational design, according to possible values of the information-theoretic measure. We finally consider an edge-based relational representation of XML documents, and show that while it has similar information-theoretic properties with other relational representations, it can behave significantly worse in terms of enforcing integrity constraints.