A Theory of Attributed Equivalence in Databases with Application to Schema Integration
IEEE Transactions on Software Engineering
Using semantic values to facilitate interoperability among heterogeneous information systems
ACM Transactions on Database Systems (TODS)
Using schematically heterogeneous structures
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
The Clio project: managing heterogeneity
ACM SIGMOD Record
Model independent assertions for integration of heterogeneous schemas
The VLDB Journal — The International Journal on Very Large Data Bases
The Inter-Database Instance Identification Problem in Integrating Autonomous Systems
Proceedings of the Fifth International Conference on Data Engineering
nD-SQL: A Multi-Dimensional Language for Interoperability and OLAP
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
iMAP: discovering complex semantic matches between database schemas
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
A comprehensive solution to the XML-to-relational mapping problem
Proceedings of the 6th annual ACM international workshop on Web information and data management
Composing mappings among data sources
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
A generic and customizable framework for the design of ETL scenarios
Information Systems - Special issue: The 15th international conference on advanced information systems engineering (CAiSE 2003)
Querying through a user interface
Data & Knowledge Engineering
Enabling revisitation of fine-grained clinical information
Proceedings of the 1st ACM International Health Informatics Symposium
The user interface is the conceptual model
ER'06 Proceedings of the 25th international conference on Conceptual Modeling
Hi-index | 0.00 |
Current methods for data integration are as difficult to use as they are powerful. Motivated by our work with clinical data and the people who analyze it, we present two components that allow non-technical users that are domain experts to create and reuse complex data integration processes. The GUAVA (GUI As View Apparatus) component enables data analysts to make informed data integration decisions based on detailed accounts of the user interface that was used to generate the data. The MultiClass component allows analysts to revisit decisions made for prior studies and reuse them or not each time the data is used. We describe these two components with examples where a warehouse of clinical data is used to support research studies. We describe the state of our implementation and why we believe the two components can be automatically translated into ETL workflows.