Scaling Access to Heterogeneous Data Sources with DISCO

Authors:
Anthony Tomasic;Louiqa Raschid;Patrick Valduriez
Affiliations:
-;-;-
Venue:
IEEE Transactions on Knowledge and Data Engineering
Year:
1998

Citing 37
Cited 72

A comparative analysis of methodologies for database schema integration

ACM Computing Surveys (CSUR)
Language features for interoperability of databases with schematic discrepancies

SIGMOD '91 Proceedings of the 1991 ACM SIGMOD international conference on Management of data
Classifying Schematic and Data Heterogeneity in Multidatabase Systems

Computer
The Pegasus Heterogeneous Multidatabase System

Computer
Mediators in the Architecture of Future Information Systems

Computer
Query evaluation techniques for large databases

ACM Computing Surveys (CSUR)
Tutorial notes on partial evaluation

POPL '93 Proceedings of the 20th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
On resolving schematic heterogeneity in multidatabase systems

Distributed and Parallel Databases
Modern database systems: the object model, interoperability, and beyond

Modern database systems: the object model, interoperability, and beyond
Data model and query evaluation in global information systems

Journal of Intelligent Information Systems - Special issue: networked information discovery and retrieval
IRO-DB: a distributed system federating object and relational databases

Object-oriented multidatabase systems
Answering queries using views (extended abstract)

PODS '95 Proceedings of the fourteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Information translation, mediation, and mosaic-based browsing in the TSIMMIS system

SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Query caching and optimization in distributed mediator systems

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Data access for the masses through OLE DB

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
The distributed information search component (Disco) and the World Wide Web

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Schema integration and query processing for multiple object databases

Integrated Computer-Aided Engineering - Special issue: multidatabase and interoperable systems
The object database standard: ODMG 2.0

The object database standard: ODMG 2.0
The Garlic project

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
InterViso: dealing with the complexity of federated database access

The VLDB Journal — The International Journal on Very Large Data Bases
APPROXIMATE: A Query Processor that Produces Monotonically Improving Approximate Answers

IEEE Transactions on Knowledge and Data Engineering
Translation of Object-Oriented Queries to Relational Queries

ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Optimizing Queries Across Diverse Data Sources

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Solving Domain Mismatch and Schema Mismatch Problems with an Object-Oriented Database Programming Language

VLDB '91 Proceedings of the 17th International Conference on Very Large Data Bases
Query Optimization in a Heterogeneous DBMS

VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Querying Heterogeneous Information Sources Using Source Descriptions

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Object Fusion in Mediator Systems

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Calibrating the Query Optimizer Cost Model of IRO-DB, an Object-Oriented Federated Database System

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
SchemaSQL - A Language for Interoperability in Relational Multi-Database Systems

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Describing and Using Query Capabilities of Heterogeneous Sources

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
M(DM): An Open Framework for Interoperation of Multimodel Multidatabase Systems

Proceedings of the Eighth International Conference on Data Engineering
Managing Change in the Rufus System

Proceedings of the Tenth International Conference on Data Engineering
Leveraging Mediator Cost Models with Heterogeneous Data Sources

ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
OASIS: An Open Architecture Scientific Information System

RIDE '96 Proceedings of the 6th International Workshop on Research Issues in Data Engineering (RIDE '96) Interoperability of Nontraditional Database Systems
Differential evaluation of continual queries

ICDCS '96 Proceedings of the 16th International Conference on Distributed Computing Systems (ICDCS '96)
Scaling heterogeneous databases and the design of Disco

ICDCS '96 Proceedings of the 16th International Conference on Distributed Computing Systems (ICDCS '96)

An adaptive query execution system for data integration

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
A user-centered interface for querying distributed multimedia databases

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
A query based approach for integrating heterogeneous data sources

Proceedings of the ninth international conference on Information and knowledge management
The state of the art in distributed query processing

ACM Computing Surveys (CSUR)
Reconciling schemas of disparate data sources: a machine-learning approach

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Answering queries with useful bindings

ACM Transactions on Database Systems (TODS)
Sharing scientific models in environmental applications

Proceedings of the 2002 ACM symposium on Applied computing
XClust: clustering XML schemas for effective integration

Proceedings of the eleventh international conference on Information and knowledge management
A WFS-based mediation system for GIS interoperability

Proceedings of the 10th ACM international symposium on Advances in geographic information systems
Design of field wrappers for mobile field data collection

Proceedings of the 10th ACM international symposium on Advances in geographic information systems
Query Decomposition for a Distributed Object-Oriented Mediator System

Distributed and Parallel Databases
Foundations of distributed interaction systems

Annals of Mathematics and Artificial Intelligence
The Ecobase project: database and web technologies for environmental information systems

ACM SIGMOD Record
An Object Algebra Approach to Multidatabase Query Decomposition in Donají

Distributed and Parallel Databases
A Unified Peer-to-Peer Database Framework for Scalable Service and Resource Discovery

GRID '02 Proceedings of the Third International Workshop on Grid Computing
Object-Oriented Mediator Queries to Internet Search Engines

OOIS '02 Proceedings of the Workshops on Advances in Object-Oriented Information Systems
Experiences with a Hybrid Implementation of a Globally Distributed Federated Database System

WAIM '01 Proceedings of the Second International Conference on Advances in Web-Age Information Management
Integrating Heterogenous Overlapping Databases through Object-Oriented Transformations

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Answering XML Queries on Heterogeneous Data Sources

Proceedings of the 27th International Conference on Very Large Data Bases
Evaluation of Join Strategies for Distributed Mediation

ADBIS '01 Proceedings of the 5th East European Conference on Advances in Databases and Information Systems
Semantic Integration and Querying of Heterogeneous Data Sources Using a Hypergraph Data Model

BNCOD 19 Proceedings of the 19th British National Conference on Databases: Advances in Databases
Query Processing in Self-Profiling Composable Peer-to-Peer Mediator Databases

EDBT '02 Proceedings of the Worshops XMLDM, MDDE, and YRWS on XML-Based Data Management and Multimedia Engineering-Revised Papers
An Architecture for Retrieval of RDF-Described Scientific Data Semantics

EDBT '02 Proceedings of the Worshops XMLDM, MDDE, and YRWS on XML-Based Data Management and Multimedia Engineering-Revised Papers
Data Integration under Integrity Constraints

CAiSE '02 Proceedings of the 14th International Conference on Advanced Information Systems Engineering
Exploiting and Completing Web Data Sources Capabilities

Proceedings of the VLDB 2002 Workshop EEXTT and CAiSE 2002 Workshop DTWeb on Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web-Revised Papers
Data Integration Using Web Services

Proceedings of the VLDB 2002 Workshop EEXTT and CAiSE 2002 Workshop DTWeb on Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web-Revised Papers
Integrating GIS and Imagery Through XML-Based Information Mediation

ISD '99 Selected Papers from the International Workshop on Integrated Spatial Databases, Digital Inages and GIS
Experiences in Federated Databases: From IRO-DB to MIRO-Web

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Combining Mediator and Data Warehouse Technologies for Developing Environmental Decision Support Systems

GIScience '02 Proceedings of the Second International Conference on Geographic Information Science
ObjectGlobe: Ubiquitous query processing on the Internet

The VLDB Journal — The International Journal on Very Large Data Bases
A User Interface for Distributed Multimedia Database Querying with Mediator Supported Refinement

IDEAS '99 Proceedings of the 1999 International Symposium on Database Engineering & Applications
The biological integration system

WIDM '03 Proceedings of the 5th ACM international workshop on Web information and data management
The virGIS WFS-based spatial mediation system

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Optimizing Recursive Information Gathering Plans in EMERAC

Journal of Intelligent Information Systems
Data integration under integrity constraints

Information Systems - Special issue: The 14th international conference on advanced information systems engineering (CAiSE*02)
Bringing together content and data management systems: Challenges and opportunities

IBM Systems Journal
A mediation framework for multimedia delivery

Proceedings of the 3rd international conference on Mobile and ubiquitous multimedia
Load and Network Aware Query Routing for Information Integration

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
SKIMA: Semantic Knowledge and Information Management

ENC '05 Proceedings of the Sixth Mexican International Conference on Computer Science
Information mediation across heterogeneous government spatial data sources

dg.o '02 Proceedings of the 2002 annual national conference on Digital government research
Achieving Communication Efficiency through Push-Pull Partitioning of Semantic Spaces to Disseminate Dynamic Information

IEEE Transactions on Knowledge and Data Engineering
Policy-based security management for federated healthcare databases (or RHIOs)

HIKM '06 Proceedings of the international workshop on Healthcare information and knowledge management
Extracting knowledge from XML document repository: a semantic Web-based approach

Information Technology and Management
Query optimization via contention space partitioning and cost error controlling for dynamic multidatabase systems

Distributed and Parallel Databases
Dynamic adaptation of multi-key index for distributed database system

ICCOMP'05 Proceedings of the 9th WSEAS International Conference on Computers
Hybrid query processing through services composition

Ph.D. '08 Proceedings of the 2008 EDBT Ph.D. workshop
DObjects: Enabling Distributed Data Services for Metacomputing Platforms

ICCS '08 Proceedings of the 8th international conference on Computational Science, Part I
Dynamic Source Selection in Large Scale Mediation Systems

Globe '08 Proceedings of the 1st international conference on Data Management in Grid and Peer-to-Peer Systems
Data fusion

ACM Computing Surveys (CSUR)
Scalable architecture for web service discovery

Proceedings of the 3rd international conference on Scalable information systems
Data Integration through ${\textit{DL-Lite}_{\mathcal A}}$ Ontologies

Semantics in Data and Knowledge Bases
Extending SOA with Semantic Mediators

Advanced Internet Based Systems and Applications
A self-adaptable query allocation framework for distributed information systems

The VLDB Journal — The International Journal on Very Large Data Bases
Conceptual Modeling for Data Integration

Conceptual Modeling: Foundations and Applications
Evolution of Query Optimization Methods: From Centralized Database Systems to Data Grid Systems

DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
Modeling of secure data extraction in ETL processes using UML 2.0

AsiaMS '07 Proceedings of the IASTED Asian Conference on Modelling and Simulation
An efficient query evaluation in a mediator based on implementation plan

Information Sciences: an International Journal
A practical approach to extracting DTD-conforming XML documents from heterogeneous data sources

Information Sciences: an International Journal
Quete: ontology-based query system for distributed sources

ADBIS'07 Proceedings of the 11th East European conference on Advances in databases and information systems
Combining artificial intelligence and databases for data integration

Artificial intelligence today
Indexing source descriptions based on defined classes

Proceedings of the Fourteenth International Database Engineering & Applications Symposium
Source selection in large scale data contexts: an optimization approach

DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part I
Ad-hoc distributed spatial joins on mobile devices

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Catalogue manager for metadata dissemination in the NetTraveler middleware system

International Journal of Intelligent Information and Database Systems
Improving source selection in large scale mediation systems through combinatorial optimization techniques

Transactions on large-scale data- and knowledge-centered systems III
Principles of distributed data management in 2020?

DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
Algorithms and software for collaborative discovery from autonomous, semantically heterogeneous, distributed information sources

ALT'05 Proceedings of the 16th international conference on Algorithmic Learning Theory
Data management in large-scale p2p systems

VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Distributed processing of large biomedical 3d images

VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Performance-oriented privacy-preserving data integration

DILS'05 Proceedings of the Second international conference on Data Integration in the Life Sciences
Query translation for distributed heterogeneous structured and semi-structured databases

BNCOD'06 Proceedings of the 23rd British National Conference on Databases, conference on Flexible and Efficient Information Handling
MDSSF: a federated architecture for product procurement

DEXA'06 Proceedings of the 17th international conference on Database and Expert Systems Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Accessing many data sources aggravates problems for users of heterogeneous distributed databases. Database administrators must deal with fragile mediators, that is, mediators with schemas and views that must be significantly changed to incorporate a new data source. When implementing translators of queries from mediators to data sources, database implementors must deal with data sources that do not support all the functionality required by mediators. Application programmers must deal with graceless failures for unavailable data sources. Queries simply return failure and no further information when data sources are unavailable for query processing. The Distributed Information Search COmponent (DISCO) addresses these problems. Data modeling techniques manage the connections to data sources, and sources can be added transparently to the users and applications. The interface between mediators and data sources flexibly handles different query languages and different data source functionality. Query rewriting and optimization techniques rewrite queries so they are efficiently evaluated by sources. Query processing and evaluation semantics are developed to process queries over unavailable data sources. In this article, we describe 1) the distributed mediator architecture of DISCO; 2) the data model and its modeling of data source connections; 3) the interface to underlying data sources and the query rewriting process; and 4) query processing semantics. We describe several advantages of our system.