Compilers: principles, techniques, and tools
Compilers: principles, techniques, and tools
A framework for supporting data integration using the materialized and virtual approaches
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Infomaster: an information integration system
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
The TSIMMIS Approach to Mediation: Data Models and Languages
Journal of Intelligent Information Systems - Special issue: next generation information technologies and systems
LDAP: programming directory-enabled applications with lightweight directory access protocol
LDAP: programming directory-enabled applications with lightweight directory access protocol
World Wide Web Journal - Special issue on XML: principles, tools, and techniques
An introduction to database systems (7th ed.)
An introduction to database systems (7th ed.)
Comparative analysis of five XML query languages
ACM SIGMOD Record
The Clio project: managing heterogeneity
ACM SIGMOD Record
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
XAS: a system for accessing componentized, virtual XML documents
ICSE '01 Proceedings of the 23rd International Conference on Software Engineering
The Jndi API Tutorial and Reference: Building Directory-Enabled Java Applications
The Jndi API Tutorial and Reference: Building Directory-Enabled Java Applications
SilkRoute: A framework for publishing relational data in XML
ACM Transactions on Database Systems (TODS)
Scaling Access to Heterogeneous Data Sources with DISCO
IEEE Transactions on Knowledge and Data Engineering
The Nimble XML Data Integration System
Proceedings of the 17th International Conference on Data Engineering
Proceedings of the 17th International Conference on Data Engineering
Tamino - A DBMS designed for XML
Proceedings of the 17th International Conference on Data Engineering
Efficiently Publishing Relational Data as XML Documents
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
XPERANTO: Middleware for Publishing Object-Relational Data as XML Documents
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Querying XML Views of Relational Data
Proceedings of the 27th International Conference on Very Large Data Bases
Quilt: An XML Query Language for Heterogeneous Data Sources
Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
Towards heterogeneous multimedia information systems: the Garlic approach
RIDE '95 Proceedings of the 5th International Workshop on Research Issues in Data Engineering-Distributed Object Management (RIDE-DOM'95)
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Oracle8i"The XML Enabled Data Management System
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
TREX: DTD-conforming XML to XML transformations
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
A Generic Load/Extract Utility for Data Transfer between XML Documents and Relational Databases
WECWIS '00 Proceedings of the Second International Workshop on Advance Issues of E-Commerce and Web-Based Information Systems (WECWIS 2000)
The JFC Swing Tutorial: A Guide to Constructing GUIs, Second Edition
The JFC Swing Tutorial: A Guide to Constructing GUIs, Second Edition
DTD-directed publishing with attribute translation grammars
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
SAIL: Structure-aware indexing for effective and progressive top-k keyword search over XML documents
Information Sciences: an International Journal
International Journal of Advanced Pervasive and Ubiquitous Computing
Hi-index | 0.07 |
XML documents are becoming popular for business process integration. To achieve interoperability between applications, XML documents must also conform to various commonly used data type definitions (DTDs). However, most business data are not maintained as XML documents. They are stored in various native formats, such as database tables or LDAP directories. Hence, a middleware is needed to dynamically generate XML documents conforming to predefined DTDs from various data sources. As industrial consortia and large corporations have created various DTDs, it is both challenging and time-consuming to design the necessary middleware to conform to so many different DTDs. This problem is particularly acute for a small- or medium-sized enterprise because it lacks the IT skills to quickly develop such a middleware. In this paper, we present XLE, an XML Lightweight Extractor, as a practical approach to dynamically extracting DTD-conforming XML documents from heterogeneous data sources. XLE is based on a framework called DTD source annotation (DTDSA). It treats a DTD as the control structure of a program. The annotations become the program statements, such as functions and assignments. DTD-conforming XML documents are generated by parsing annotated DTDs. Basically, DTD annotations describe declaratively the mappings between target XML documents and the source data. The XLE engine implements a few basic annotations, providing a practical solution for many small- and medium-sized enterprises. However, XLE is designed to be versatile. It allows sophisticated users to plug in their own implementations to access new types of data or to achieve better performance. Heterogeneous data sources can be simply specified in the annotations. A GUI tool is provided to highlight the places where annotations are needed.