Approximate query mapping: Accounting for translation closeness

Authors:
Kevin Chen-Chuan Chang;Hé/ctor Garcí/a-Molina
Affiliations:
Computer Science Department, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA/ e-mail: kcchang@cs.uiuc.edu;Computer Science Department, Stanford University, Stanford, CA 94305, USA/ E-mail: hector@db.stanford.edu
Venue:
The VLDB Journal — The International Journal on Very Large Data Bases
Year:
2001

Citing 35
Cited 10

Automatic text processing

Automatic text processing
A semantics for complex objects and approximate answers

Journal of Computer and System Sciences
Mediators in the Architecture of Future Information Systems

Computer
Finding nonrecursive envelopes for Datalog predicate

PODS '93 Proceedings of the twelfth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
The design and implementation of CoBase

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Using semantic values to facilitate interoperability among heterogeneous information systems

ACM Transactions on Database Systems (TODS)
Can Datalog be approximated?

PODS '94 Proceedings of the thirteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Answering queries using templates with binding patterns (extended abstract)

PODS '95 Proceedings of the fourteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Answering queries using limited external query processors (extended abstract)

PODS '96 Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
On saying “Enough already!” in SQL

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Infomaster: an information integration system

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Managing semantic heterogeneity in databases: a theoretical prospective

PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Your mediators need data conversion!

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Conjunctive constraint mapping for data translation

Proceedings of the third ACM conference on Digital libraries
Mind your vocabulary: query mapping across heterogeneous information sources

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Information Retrieval

Information Retrieval
The Design and Analysis of Computer Algorithms

The Design and Analysis of Computer Algorithms
Resolving Database Incompatibility: An Approach to Performing Relational Operations over Mismatched Domains

IEEE Transactions on Knowledge and Data Engineering
Information Integration

IEEE Intelligent Systems
A Query Translation Scheme for Rapid Implementation of Wrappers

DOOD '95 Proceedings of the Fourth International Conference on Deductive and Object-Oriented Databases
MedMaker: A Mediation System Based on Declarative Specifications

ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Correspondence and Translation for Heterogeneous Data

ICDT '97 Proceedings of the 6th International Conference on Database Theory
Information Integration Using Logical Views

ICDT '97 Proceedings of the 6th International Conference on Database Theory
Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Optimizing Queries Across Diverse Data Sources

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Reducing the Braking Distance of an SQL Query Engine

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Filtering with Approximate Predicates

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
A Data Transformation System for Biological Data Sources

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Querying Heterogeneous Information Sources Using Source Descriptions

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Fast Approximate Query Answering Using Precomputed Statistics

ICDE '99 Proceedings of the 15th International Conference on Data Engineering
On Getting Some Answers Quickly, and Perhaps More Later

ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Capability-Sensitive Query Processing on Internet Sources

ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Query planning and optimization in information integration

Query planning and optimization in information integration
Query and data mapping across heterogeneous information sources

Query and data mapping across heterogeneous information sources
Query-answering algorithms for information agents

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1

Approximate Information Filtering on the Semantic Web

KI '02 Proceedings of the 25th Annual German Conference on AI: Advances in Artificial Intelligence
Exploiting Partially Shared Ontologies for Multi-agent Communication

CIA '02 Proceedings of the 6th International Workshop on Cooperative Information Agents VI
Acquisition of Soft Taxonomies for Intelligent Personal Hierarchies and the Soft Semantic Web

BT Technology Journal
Mediators over taxonomy-based information sources

The VLDB Journal — The International Journal on Very Large Data Bases
Light-weight domain-based form assistant: querying web databases on the fly

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Approximate information retrieval based on multielement bounds

Knowledge-Based Systems
Fuzzy sets in the fight against digital obesity

Fuzzy Sets and Systems
Refined approximation of concepts in ontology

AI*IA'05 Proceedings of the 9th conference on Advances in Artificial Intelligence
Approximations of concept based on multielement bounds

DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
Improving access to multimedia using multi-source hierarchical meta-data

AMR'05 Proceedings of the Third international conference on Adaptive Multimedia Retrieval: user, context, and feedback

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present a mechanism for approximately translating Boolean query constraints across heterogeneous information sources. Achieving the best translation is challenging because sources support different constraints for formulating queries, and often these constraints cannot be precisely translated. For instance, a query [score8] might be “perfectly” translated as [rating0.8] at some site, but can only be approximated as [grade=A] at another. Unlike other work, our general framework adopts a customizable “closeness” metric for the translation that combines both precision and recall. Our results show that for query translation we need to handle interdependencies among both query conjuncts as well as disjuncts. As the basis, we identify the essential requirements of a rule system for users to encode the mappings for atomic semantic units. Our algorithm then translates complex queries by rewriting them in terms of the semantic units. We show that, under practical assumptions, our algorithm generates the best approximate translations with respect to the closeness metric of choice. We also present a case study to show how our technique may be applied in practice.