Proceedings of the 10th international conference on World Wide Web
SilkRoute: A framework for publishing relational data in XML
ACM Transactions on Database Systems (TODS)
Query processing of streamed XML data
Proceedings of the eleventh international conference on Information and knowledge management
Schema Mapping as Query Discovery
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Efficiently Publishing Relational Data as XML Documents
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
ACM Transactions on Database Systems (TODS)
NEXSORT: Sorting XML in External Memory
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
The Piazza Peer Data Management System
IEEE Transactions on Knowledge and Data Engineering
The BEA streaming XQuery processor
The VLDB Journal — The International Journal on Very Large Data Bases
The VLDB Journal — The International Journal on Very Large Data Bases
Supporting executable mappings in model management
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Extending XQuery for analytics
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Clio grows up: from research prototype to industrial tool
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
HePToX: marrying XML and heterogeneity in your P2P databases
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Data exchange: semantics and query answering
Theoretical Computer Science - Database theory
Nested mappings: schema mapping reloaded
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Mapping adaptation under evolving schemas
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
The NEXT framework for logical XQuery optimization
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Towards automatic generation of AXML web services for dynamic data integration
DataX '08 Proceedings of the 2008 EDBT workshop on Database technologies for handling XML information on the web
StreamTX: extracting tuples from streaming XML data
Proceedings of the VLDB Endowment
On keys, foreign keys and nullable attributes in relational mapping systems
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Clio: Schema Mapping Creation and Data Exchange
Conceptual Modeling: Foundations and Applications
Dissemination of heterogeneous XML data in publish/subscibe systems
Proceedings of the 18th ACM conference on Information and knowledge management
Towards the preservation of functional dependency in XML data transformation
International Journal of Intelligent Information and Database Systems
Semantic annotation of XML-schema for document transformations
OTM'10 Proceedings of the 2010 international conference on On the move to meaningful internet systems
Predicting the maintainability of XSL transformations
Science of Computer Programming
s-XML: An efficient mapping scheme to bridge XML and relational database
Knowledge-Based Systems
TempoXML: Nested bitemporal relationship modeling and conversion tool for fuzzy XML
Information Sciences: an International Journal
Hi-index | 0.00 |
Clio is an existing schema-mapping tool that provides user-friendly means to manage and facilitate the complex task of transformation and integration of heterogeneous data such as XML over the Web or in XML databases. By means of mappings from source to target schemas, Clio can help users conveniently establish the precise semantics of data transformation and integration. In this paper we study the problem of how to efficiently implement such data transformation (i.e., generating target data from the source data based on schema mappings). We present a three-phase framework for high-performance XML-to-XML transformation based on schema mappings, and discuss methodologies and algorithms for implementing these phases. In particular, we elaborate on novel techniques such as streamed extraction of mapped source values and scalable disk-based merging of overlapping data (including duplicate elimination). We compare our transformation framework with alternative methods such as using XQuery or SQL/XML provided by current commercial databases. The results demonstrate that the three-phase framework (although as simple as it is) is highly scalable and outperforms the alternative methods by orders of magnitude.