Conceptual-model-based data extraction from multiple-record Web pages
Data & Knowledge Engineering
Data-driven understanding and refinement of schema mappings
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Data integration: a theoretical perspective
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Schema Mapping as Query Discovery
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
Similarity Flooding: A Versatile Graph Matching Algorithm and Its Application to Schema Matching
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Metaheuristics in combinatorial optimization: Overview and conceptual comparison
ACM Computing Surveys (CSUR)
A survey on the use of relevance feedback for information access systems
The Knowledge Engineering Review
Data integration under integrity constraints
Information Systems - Special issue: The 14th international conference on advanced information systems engineering (CAiSE*02)
iMAP: discovering complex semantic matches between database schemas
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
How to Solve It: Modern Heuristics
How to Solve It: Modern Heuristics
Integrating Data from Disparate Sources: A Mass Collaboration Approach
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Schema mappings, data exchange, and metadata management
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Relational languages for metadata integration
ACM Transactions on Database Systems (TODS)
Mapping maintenance for data integration systems
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Data exchange: semantics and query answering
Theoretical Computer Science - Database theory
Composing schema mappings: Second-order dependencies to the rescue
ACM Transactions on Database Systems (TODS) - Special Issue: SIGMOD/PODS 2004
Principles of dataspace systems
Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Data integration: the teenage years
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Debugging schema mappings with routes
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
SPIDER: a schema mapPIng DEbuggeR
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Why is schema matching tough and what can we do about it?
ACM SIGMOD Record
A composite approach to automating direct and indirect schema mappings
Information Systems
Communications of the ACM - ACM at sixty: a look back in time
Matching large schemas: Approaches and evaluation
Information Systems
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Building structured web community portals: a top-down, compositional, and incremental approach
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
iTrails: pay-as-you-go information integration in dataspaces
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Schema mapping verification: the spicy way
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Pay-as-you-go user feedback for dataspace systems
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Bootstrapping pay-as-you-go data integration systems
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
The Spicy system: towards a notion of mapping quality
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Learning to create data-integrating queries
Proceedings of the VLDB Endowment
Data integration with uncertainty
The VLDB Journal — The International Journal on Very Large Data Bases
Muse: Mapping Understanding and deSign by Example
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Aggregate Query Answering under Uncertain Schema Mappings
ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
Quarrying dataspaces: Schemaless profiling of unfamiliar information sources
ICDEW '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering Workshop
Cooperative update exchange in the Youtopia system
Proceedings of the VLDB Endowment
Feedback-driven result ranking and query refinement for exploring semi-structured data collections
Proceedings of the 13th International Conference on Extending Database Technology
Feedback-based annotation, selection and refinement of schema mappings for dataspaces
Proceedings of the 13th International Conference on Extending Database Technology
Characterizing schema mappings via data examples
Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Automatically incorporating new sources in keyword search-based data integration
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Schema clustering and retrieval for multi-domain pay-as-you-go data integration systems
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
MapMerge: correlating independent schema mappings
Proceedings of the VLDB Endowment
Designing and refining schema mappings via data examples
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Hi-index | 0.00 |
One aspect of the vision of dataspaces has been articulated as providing various benefits of classical data integration with reduced up-front costs. In this paper, we present techniques that aim to support schema mapping specification through interaction with end users in a pay-as-you-go fashion. In particular, we show how schema mappings, that are obtained automatically using existing matching and mapping generation techniques, can be annotated with metrics estimating their fitness to user requirements using feedback on query results obtained from end users. Using the annotations computed on the basis of user feedback, and given user requirements in terms of precision and recall, we present a method for selecting the set of mappings that produce results meeting the stated requirements. In doing so, we cast mapping selection as an optimization problem. Feedback may reveal that the quality of schema mappings is poor. We show how mapping annotations can be used to support the derivation of better quality mappings from existing mappings through refinement. An evolutionary algorithm is used to efficiently and effectively explore the large space of mappings that can be obtained through refinement. User feedback can also be used to annotate the results of the queries that the user poses against an integration schema. We show how estimates for precision and recall can be computed for such queries. We also investigate the problem of propagating feedback about the results of (integration) queries down to the mappings used to populate the base relations in the integration schema.