A normal form for relational databases that is based on domains and keys
ACM Transactions on Database Systems (TODS)
On verifying consistency of XML specifications
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
On XML integrity constraints in the presence of DTDs
Journal of the ACM (JACM)
Justification for Inclusion Dependency Normal Form
IEEE Transactions on Knowledge and Data Engineering
Journal of Computer and System Sciences - Special issue on PODS 2000
A normal form for XML documents
ACM Transactions on Database Systems (TODS)
Strong functional dependencies and their application to normal forms in XML
ACM Transactions on Database Systems (TODS)
An information-theoretic approach to normal forms for relational and XML data
Journal of the ACM (JACM)
Design principles for xml data
Design principles for xml data
Propagating XML constraints to relations
Journal of Computer and System Sciences
ACM SIGMOD Record
Dependency-preserving normalization of relational and XML data
Journal of Computer and System Sciences
XML schema refinement through redundancy detection and normalization
The VLDB Journal — The International Journal on Very Large Data Bases
CITWORKSHOPS '08 Proceedings of the 2008 IEEE 8th International Conference on Computer and Information Technology Workshops
On Defining Functional Dependency for XML
ICSC '09 Proceedings of the 2009 IEEE International Conference on Semantic Computing
Keys in XML: Capturing Identification and Uniqueness
WISE '09 Proceedings of the 10th International Conference on Web Information Systems Engineering
Structural and semantic aspects of similarity of Document Type Definitions and XML schemas
Information Sciences: an International Journal
Multivalued dependencies and a 4NF for XML
CAiSE'03 Proceedings of the 15th international conference on Advanced information systems engineering
Element similarity measures in XML schema matching
Information Sciences: an International Journal
Hi-index | 0.07 |
Compared with relational data, it is more difficult to normalize XML data. In the relational data model, semantically relevant attributes compose relations which can simplify the normalization issue. But limited by the structural characteristics, the semantic relevancies of XML data cannot be outlined explicitly. Therefore, in the existing XML normalization proposals, XML constraints hold in the unsuitable ranges and cannot authentically match the original information relevancies. In this paper, a kind of semantically relevant information sets- entity segments are used to limit the ranges where XML constraints hold. Based on entity segments, XML constraints are defined as XML attribute dependencies which can authentically reflect the original information relevancies. Simultaneously, entity primary keys are defined as the unique identifiers of entity segments, and the relationships among different entity segments are denoted by the concept of entity foreign key. To form a normalization system for XML schema design, the XML integrity rules and the XML normal form are proposed, the effect of the XML integrity rules and the XML normal form on normalizing XML data is shown by practical instances. And the information-theoretic measure is used to justify their roles further. It is concluded that entity segments are the suitable ranges where XML constraints can authentically match original information relevancies and the proposal presented in this paper is not only effective on avoiding XML data redundancies but also on keeping XML data consistencies.