One-unambiguous regular languages
Information and Computation
DTD inference for views of XML data
PODS '00 Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Everything You Ever Wanted to Know About DTDs, But Were Afraid to Ask (Extended Abstract)
Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
Taxonomy of XML schema languages using formal language theory
ACM Transactions on Internet Technology (TOIT)
Expressiveness of XSDs: from practice to theory, there and back again
WWW '05 Proceedings of the 14th international conference on World Wide Web
Impact of XML schema evolution on valid documents
Proceedings of the 7th annual ACM international workshop on Web information and data management
Inference of concise DTDs from XML data
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Expressiveness and complexity of XML Schema
ACM Transactions on Database Systems (TODS)
Journal of Computer and System Sciences
ACM SIGMOD Record
An edit operation-based approach to the inclusion problem for DTDs
Proceedings of the 2007 ACM symposium on Applied computing
Simple off the shelf abstractions for XML schema
ACM SIGMOD Record
Inferring XML schema definitions from XML data
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Learning deterministic regular expressions for the inference of schemas from XML data
Proceedings of the 17th international conference on World Wide Web
SchemaScope: a system for inferring and cleaning XML schemas
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
FLUX: functional updates for XML
Proceedings of the 13th ACM SIGPLAN international conference on Functional programming
Inclusion Test Algorithms for One-Unambiguous Regular Expressions
Proceedings of the 5th international colloquium on Theoretical Aspects of Computing
Linear time membership in a class of regular expressions with interleaving and counting
Proceedings of the 17th ACM conference on Information and knowledge management
Discovering XML keys and foreign keys in queries
Proceedings of the 2009 ACM symposium on Applied Computing
Towards inference of more realistic XSDs
Proceedings of the 2009 ACM symposium on Applied Computing
An X-ray on web-available XML schemas
ACM SIGMOD Record
Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Simplifying XML schema: effortless handling of nondeterministic regular expressions
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Efficient inclusion for a class of XML types with interleaving and counting
Information Systems
Detection of corrupted schema mappings in XML data integration systems
ACM Transactions on Internet Technology (TOIT)
An Automata-Theoretic Approach to Regular XPath
DBPL '09 Proceedings of the 12th International Symposium on Database Programming Languages
Efficient inclusion checking for deterministic tree automata and XML Schemas
Information and Computation
Inference of concise regular expressions and DTDs
ACM Transactions on Database Systems (TODS)
Efficient inclusion for a class of XML types with interleaving and counting
DBPL'07 Proceedings of the 11th international conference on Database programming languages
DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
On the tradeoff between mapping and querying power in XML data exchange
Proceedings of the 13th International Conference on Database Theory
XML: some papers in a haystack
ACM SIGMOD Record
Learning Deterministic Regular Expressions for the Inference of Schemas from XML Data
ACM Transactions on the Web (TWEB)
On inference of XML schema with the knowledge of an obsolete one
ADC '09 Proceedings of the Twentieth Australasian Conference on Australasian Database - Volume 92
XML schema computations: schema compatibility testing and subschema extraction
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Analyzer: a framework for file analysis
DASFAA'10 Proceedings of the 15th international conference on Database systems for advanced applications
Subtyping algorithm of regular tree grammars with disjoint production rules
ICTAC'10 Proceedings of the 7th International colloquium conference on Theoretical aspects of computing
Quality assessment of MAGE-ML genomic datasets using DescribeX
DILS'10 Proceedings of the 7th international conference on Data integration in the life sciences
Checking determinism of XML Schema content models in optimal time
Information Systems
Complexity of Decision Problems for XML Schemas and Chain Regular Expressions
SIAM Journal on Computing
Generating schema mappings based on annotations in a P2P data integration system
Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services
Assisting the design of XML schema: diagnosing nondeterministic content models
APWeb'11 Proceedings of the 13th Asia-Pacific web conference on Web technologies and applications
Proceedings of the 20th ACM international conference on Information and knowledge management
Optimizing schema languages for XML: numerical constraints and interleaving
ICDT'07 Proceedings of the 11th international conference on Database Theory
X-Evolution: a system for XML schema evolution and document adaptation
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Towards an XML representation of proper names and their relationships
NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
Axiomatising functional dependencies for XML with frequencies
FoIKS'06 Proceedings of the 4th international conference on Foundations of Information and Knowledge Systems
Efficient incremental validation of XML documents after composite updates
XSym'06 Proceedings of the 4th international conference on Database and XML Technologies
A quantitative summary of XML structures
ER'06 Proceedings of the 25th international conference on Conceptual Modeling
Deterministic regular expressions in linear time
PODS '12 Proceedings of the 31st symposium on Principles of Database Systems
Foundations of XML based on logic and automata: a snapshot
FoIKS'12 Proceedings of the 7th international conference on Foundations of Information and Knowledge Systems
Developing and analyzing XSDs through BonXai
Proceedings of the VLDB Endowment
Between tree patterns and conjunctive queries: is there tractability beyond acyclicity?
MFCS'12 Proceedings of the 37th international conference on Mathematical Foundations of Computer Science
XPath query satisfiability is in PTIME for real-world DTDs
XSym'07 Proceedings of the 5th international conference on Database and XML Technologies
XML schema evolution: incremental validation and efficient document adaptation
XSym'07 Proceedings of the 5th international conference on Database and XML Technologies
Web Semantics: Science, Services and Agents on the World Wide Web
Almost-linear inclusion for XML regular expression types
ACM Transactions on Database Systems (TODS)
Hi-index | 0.00 |
Among the various proposals answering the shortcomings of Document Type Definitions (DTDs), XML Schema is the most widely used. Although DTDs and XML Schema Definitions (XSDs) differ syntactically, they are still quite related on an abstract level. Indeed, freed from all syntactic sugar, XML Schemas can be seen as an extension of DTDs with a restricted form of specialization. In the present paper, we inspect a number of DTDs and XSDs harvested from the web and try to answer the following questions: (1) which of the extra features/expressiveness of XML Schema not allowed by DTDs are effectively used in practice; and, (2) how sophisticated are the structural properties (i.e. the nature of regular expressions) of the two formalisms. It turns out that at present real-world XSDs only sparingly use the new features introduced by XML Schema: on a structural level the vast majority of them can already be defined by DTDs. Further, we introduce a class of simple regular expressions and obtain that a surprisingly high fraction of the content models belong to this class. The latter result sheds light on the justification of simplifying assumptions that sometimes have to be made in XML research.