Discovering XML keys and foreign keys in queries
Proceedings of the 2009 ACM symposium on Applied Computing
Spoilt for Choice: Full First-Order Hierarchical Decompositions
WoLLIC '09 Proceedings of the 16th International Workshop on Logic, Language, Information and Computation
Exploring XML web collections with DescribeX
ACM Transactions on the Web (TWEB)
Detecting XML functional dependencies through formal concept analysis
ADBIS'10 Proceedings of the 14th east European conference on Advances in databases and information systems
Solving the implication problem for XML functional dependencies with properties
WoLLIC'10 Proceedings of the 17th international conference on Logic, language, information and computation
Fast detection of functional dependencies in XML data
XSym'10 Proceedings of the 7th international XML database conference on Database and XML technologies
Design non-recursive and redundant-free XML conceptual schema with hypergraph
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications
Attribute grammar for XML integrity constraint validation
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
A workload-aware approach for optimizing the XML schema design trade-off
Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services
Design by example for SQL table definitions with functional dependencies
The VLDB Journal — The International Journal on Very Large Data Bases
On the existence of armstrong data trees for XML functional dependencies
FoIKS'10 Proceedings of the 6th international conference on Foundations of Information and Knowledge Systems
On Inferences of Full First-Order Hierarchical Decompositions
Fundamenta Informaticae - Logic, Language, Information and Computation
XML data exchange with target constraints
Information Processing and Management: an International Journal
Formal Framework of XML Document Schema Design
International Journal of Information Retrieval Research
XML Documents Normalization Using GN-DTD
International Journal of Information Retrieval Research
Discovering conditional functional dependencies in XML data
ADC '11 Proceedings of the Twenty-Second Australasian Database Conference - Volume 115
Discovering XSD keys from XML data
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
XML normalization based on entity segments
Information Sciences: an International Journal
International Journal of Information Technology and Web Engineering
Hi-index | 0.00 |
As XML becomes increasingly popular, XML schema design has become an increasingly important issue. One of the central objectives of good schema design is to avoid data redundancies: redundantly stored information can lead not just only to a higher data storage cost but also to increased costs for data transfer and data manipulation. Furthermore, such data redundancies can lead to potential update anomalies, rendering the database inconsistent. One strategy to avoid data redundancies is to design redundancy-free schema from the start on the basis of known functional dependencies. We observe that XML databases are often "casually designed" and XML FDs may not be determined in advance. Under such circumstances, discovering XML data redundancies from the data itself becomes necessary and is an integral part of the schema refinement (or re-design) process. We present the design and implementation of the first system, DiscoverXFD, for efficient discovery of XML data redundancies. It employs a novel XML data structure and introduces a new class of partition-based algorithms. The XML data redundancies are defined on the basis of a new notion of XML functional dependency (XML FD) that (1) extends previous notions by incorporating set elements into the XML FD specification, and (2) maintains tuple-based semantics through the novel concept of Generalized Tree Tuple (GTT). Using this comprehensive XML FD notion, we introduce a new normal form (GTT-XNF) for XML documents, and provide comprehensive comparisons with previous studies. Given the set of data redundancies (in the form of redundancy-indicating XML FDs) discovered by DiscoverXFD, we describe a normalization algorithm for converting any original XML schema into one in GTT-XNF.