A comparative analysis of methodologies for database schema integration
ACM Computing Surveys (CSUR)
Federated database systems for managing distributed, heterogeneous, and autonomous databases
ACM Computing Surveys (CSUR) - Special issue on heterogeneous databases
Exploring the similarity space
ACM SIGIR Forum
Semantic integration of heterogeneous information sources
Data & Knowledge Engineering - Special issue on heterogeneous information resources need semantic access
The Clio project: managing heterogeneity
ACM SIGMOD Record
Reconciling schemas of disparate data sources: a machine-learning approach
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Learning to map between ontologies on the semantic web
Proceedings of the 11th international conference on World Wide Web
Optimization by Vector Space Methods
Optimization by Vector Space Methods
XClust: clustering XML schemas for effective integration
Proceedings of the eleventh international conference on Information and knowledge management
Global Viewing of Heterogeneous Data Sources
IEEE Transactions on Knowledge and Data Engineering
Generic Schema Matching with Cupid
Proceedings of the 27th International Conference on Very Large Data Bases
A Semantic Approach to XML-based Data Integration
ER '01 Proceedings of the 20th International Conference on Conceptual Modeling: Conceptual Modeling
Comparison of Schema Matching Evaluations
Revised Papers from the NODe 2002 Web and Database-Related Workshops on Web, Web-Services, and Database Systems
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
On schema matching with opaque column names and data values
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Statistical schema matching across web query interfaces
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Similarity Flooding: A Versatile Graph Matching Algorithm and Its Application to Schema Matching
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
A framework for modeling and evaluating automatic semantic reconciliation
The VLDB Journal — The International Journal on Very Large Data Bases
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Automatic ontology matching using application semantics
AI Magazine - Special issue on semantic integration
Integration of XML schemas at various "severity" levels
Information Systems
eTuner: tuning schema matching software using synthetic scenarios
The VLDB Journal — The International Journal on Very Large Data Bases
COMA: a system for flexible combination of schema matching approaches
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Instance-based schema matching for web databases by domain-specific query probing
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Bootstrapping pay-as-you-go data integration systems
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Data integration with uncertainty
The VLDB Journal — The International Journal on Very Large Data Bases
Semantic precision and recall for ontology alignment evaluation
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Tuning the ensemble selection process of schema matchers
Information Systems
Schema Matching and Mapping
AMC - A framework for modelling and comparing matching systems as matching processes
ICDE '11 Proceedings of the 2011 IEEE 27th International Conference on Data Engineering
A survey of schema-based matching approaches
Journal on Data Semantics IV
CMC: combining multiple schema-matching strategies based on credibility prediction
DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Schema integration based on uncertain semantic mappings
ER'05 Proceedings of the 24th international conference on Conceptual Modeling
A Self-Configuring Schema Matching System
ICDE '12 Proceedings of the 2012 IEEE 28th International Conference on Data Engineering
Non-binary evaluation for schema matching
ER'12 Proceedings of the 31st international conference on Conceptual Modeling
Hi-index | 0.00 |
Web-scale data integration involves fully automated efforts which lack knowledge of the exact match between data descriptions. In this paper, we introduce schema matching prediction, an assessment mechanism to support schema matchers in the absence of an exact match. Given attribute pair-wise similarity measures, a predictor predicts the success of a matcher in identifying correct correspondences. We present a comprehensive framework in which predictors can be defined, designed, and evaluated. We formally define schema matching evaluation and schema matching prediction using similarity spaces and discuss a set of four desirable properties of predictors, namely correlation, robustness, tunability, and generalization. We present a method for constructing predictors, supporting generalization, and introduce prediction models as means of tuning prediction toward various quality measures. We define the empirical properties of correlation and robustness and provide concrete measures for their evaluation. We illustrate the usefulness of schema matching prediction by presenting three use cases: We propose a method for ranking the relevance of deep Web sources with respect to given user needs. We show how predictors can assist in the design of schema matching systems. Finally, we show how prediction can support dynamic weight setting of matchers in an ensemble, thus improving upon current state-of-the-art weight setting methods. An extensive empirical evaluation shows the usefulness of predictors in these use cases and demonstrates the usefulness of prediction models in increasing the performance of schema matching.