Managing uncertainty in schema matching with top-k schema mappings

Authors:
Avigdor Gal
Affiliations:
Technion – Israel Institute of Technology, Haifa, Israel
Venue:
Journal on Data Semantics VI
Year:
2006

Citing 22
Cited 26

Efficient algorithms for finding maximum matching in graphs

ACM Computing Surveys (CSUR)
Federated database systems for managing distributed, heterogeneous, and autonomous databases

ACM Computing Surveys (CSUR) - Special issue on heterogeneous databases
Managing semantic heterogeneity in databases: a theoretical prospective

PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Conceptual schema analysis: techniques and applications

ACM Transactions on Database Systems (TODS)
LEDA: a platform for combinatorial and geometric computing

LEDA: a platform for combinatorial and geometric computing
Semantic integration of heterogeneous information sources

Data & Knowledge Engineering - Special issue on heterogeneous information resources need semantic access
The Clio project: managing heterogeneity

ACM SIGMOD Record
Reconciling schemas of disparate data sources: a machine-learning approach

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Learning to map between ontologies on the semantic web

Proceedings of the 11th international conference on World Wide Web
Data modelling versus ontology engineering

ACM SIGMOD Record
Schema Mapping as Query Discovery

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Optimizing Multi-Feature Queries for Image Databases

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
The Grand Challenge in Information Technology and the Illusion of Validity

CAiSE '02 Proceedings of the 14th International Conference on Advanced Information Systems Engineering
The Use of Machine-Generated Ontologies in Dynamic Information Seeking

CooplS '01 Proceedings of the 9th International Conference on Cooperative Information Systems
Formal Ontology Engineering in the DOGMA Approach

On the Move to Meaningful Internet Systems, 2002 - DOA/CoopIS/ODBASE 2002 Confederated International Conferences DOA, CoopIS and ODBASE 2002
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
A survey of approaches to automatic schema matching

The VLDB Journal — The International Journal on Very Large Data Bases
Representing and reasoning about mappings between domain models

Eighteenth national conference on Artificial intelligence
Rondo: a programming platform for generic model management

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
A framework for modeling and evaluating automatic semantic reconciliation

The VLDB Journal — The International Journal on Very Large Data Bases
Automatic ontology matching using application semantics

AI Magazine - Special issue on semantic integration
COMA: a system for flexible combination of schema matching approaches

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases

Why is schema matching tough and what can we do about it?

ACM SIGMOD Record
Model management 2.0: manipulating richer mappings

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Rank Aggregation for Automatic Schema Matching

IEEE Transactions on Knowledge and Data Engineering
Data integration with uncertainty

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Schema mapping verification: the spicy way

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Matching large ontologies: A divide-and-conquer approach

Data & Knowledge Engineering
Managing Uncertainty in Schema Matcher Ensembles

SUM '07 Proceedings of the 1st international conference on Scalable Uncertainty Management
Preference-Based Uncertain Data Integration

EKAW '08 Proceedings of the 16th international conference on Knowledge Engineering: Practice and Patterns
Providing Top-K Alternative Schema Matchings with ${\mathcal{O}}nto {\mathcal{M}}atcher$

ER '08 Proceedings of the 27th International Conference on Conceptual Modeling
Ten Challenges for Ontology Matching

OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet Systems
Advances in Ontology Matching

Advances in Web Semantics I
Improving XML schema matching performance using Prüfer sequences

Data & Knowledge Engineering
Top-k generation of integrated schemas based on directed and weighted correspondences

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
A Prioritized Collective Selection Strategy for Schema Matching across Query Interfaces

BNCOD 26 Proceedings of the 26th British National Conference on Databases: Dataspace: The Final Frontier
Tuning the ensemble selection process of schema matchers

Information Systems
PruSM: a prudent schema matching approach for web forms

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Combining schema and level-based matching for web service discovery

ICWE'10 Proceedings of the 10th international conference on Web engineering
Restricting the overlap of Top-N sets in schema matching

Proceedings of the 1st Workshop on New Trends in Similarity Search
Discovery of probabilistic mappings between taxonomies: principles and experiments

Journal on data semantics XV
A clustering-based approach for large-scale ontology matching

ADBIS'11 Proceedings of the 15th international conference on Advances in databases and information systems
Coherent top-k ontology alignment for OWL EL

SUM'11 Proceedings of the 5th international conference on Scalable uncertainty management
OntoBuilder: fully automatic extraction and consolidation of ontologies from web sources using sequence semantics

EDBT'06 Proceedings of the 2006 international conference on Current Trends in Database Technology
Efficient management of uncertainty in XML schema matching

The VLDB Journal — The International Journal on Very Large Data Bases
Making sense of top-k matchings: a unified match graph for schema matching

Proceedings of the Ninth International Workshop on Information Integration on the Web
Schema matching and embedded value mapping for databases with opaque column names and mixed continuous and discrete-valued data fields

ACM Transactions on Database Systems (TODS)
Reducing uncertainty of schema matching via crowdsourcing

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we propose to extend current practice in schema matching with the simultaneous use of top-K schema mappings rather than a single best mapping. This is a natural extension of existing methods (which can be considered to fall into the top-1 category), taking into account the imprecision inherent in the schema matching process. The essence of this method is the simultaneous generation and examination of K best schema mappings to identify useful mappings. The paper discusses efficient methods for generating top-K methods and propose a generic methodology for the simultaneous utilization of top-K mappings. We also propose a concrete heuristic that aims at improving precision at the cost of recall. We have tested the heuristic on real as well as synthetic data and anlyze the emricial results. The novelty of this paper lies in the robust extension of existing methods for schema matching, one that can gracefully accommodate less-than-perfect scenarios in which the exact mapping cannot be identified in a single iteration. Our proposal represents a step forward in achieving fully automated schema matching, which is currently semi-automated at best.