Allocating Data and Operations to Nodes in Distributed Database Design

Authors:
Salvatore T. March;Sangkyu Rho
Affiliations:
-;-
Venue:
IEEE Transactions on Knowledge and Data Engineering
Year:
1995

Citing 25
Cited 17

Distributed databases principles and systems

Distributed databases principles and systems
Approximating block accesses in database organizations

Information Processing Letters
An algorithm for concurrency control and recovery in replicated distributed databases

ACM Transactions on Database Systems (TODS)
Vertical partitioning algorithms for database design

ACM Transactions on Database Systems (TODS)
Fragmentation: a technique for efficient query processing

ACM Transactions on Database Systems (TODS)
On the interdependencies between record structure and access path design

Journal of Management Information Systems - Special Issue: Database Management
Data allocation in distributed database systems

ACM Transactions on Database Systems (TODS)
Partition Strategy for Distributed Query Processing in Fast Local Networks

IEEE Transactions on Software Engineering
On Optimal Site Assignment for Relations in the Distributed Database Environment

IEEE Transactions on Software Engineering
Distributed database design: a practical approach and example

ACM SIGMOD Record
On the effect of join operations on relation sizes

ACM Transactions on Database Systems (TODS)
Using genetic algorithms to solve NP-complete problems

Proceedings of the third international conference on Genetic algorithms
Principles of distributed database systems

Principles of distributed database systems
The multicast policy and its relationship to replicated data placement

ACM Transactions on Database Systems (TODS)
Adaptation in natural and artificial systems

Adaptation in natural and artificial systems
Implications of certain assumptions in database performance evauation

ACM Transactions on Database Systems (TODS)
Comparative Models of the File Assignment Problem

ACM Computing Surveys (CSUR)
Techniques for Structuring Database Records

ACM Computing Surveys (CSUR)
Approximating block accesses in database organizations

Communications of the ACM
Evaluation of database access paths

SIGMOD '78 Proceedings of the 1978 ACM SIGMOD international conference on management of data
Genetic Algorithms in Search, Optimization and Machine Learning

Genetic Algorithms in Search, Optimization and Machine Learning
Data Management and File Processing

Data Management and File Processing
Computer-Assisted Data Base Design

Computer-Assisted Data Base Design
A Descriptive Model of Physical Database Design Problems and Solutions

Proceedings of the First International Conference on Data Engineering
An Iterative Method for Distributed Database Design

VLDB '91 Proceedings of the 17th International Conference on Very Large Data Bases

Five Steps to Construct a Model of Data Allocation for Distributed Database Systems

Journal of Intelligent Information Systems
File Assignment in Parallel I/O Systems with Minimal Variance of Service Time

IEEE Transactions on Computers
The effects of parallel processing on update response time in distributed database design

ICIS '00 Proceedings of the twenty first international conference on Information systems
Product data allocation for distributed product data management system

Computers in Industry
Evolutionary Algorithm Performance Profiles on the Adaptive Distributed Database Management Problem

BT Technology Journal
On the impact of network latency on distributed systems design

Information Technology and Management
Negotiation on Data Allocation in Multi-Agent Environments

Autonomous Agents and Multi-Agent Systems
Automatic Generation of Control Parameters for the Threshold Accepting Algorithm

MICAI '02 Proceedings of the Second Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence
A Powerful Direct Mechanism for Optimal WWW Content Replication

IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Allocating Fragments in Distributed Databases

IEEE Transactions on Parallel and Distributed Systems
Theoretical properties of two problems of distribution of interrelated data

Proceedings of the 44th annual Southeast regional conference
Complete and fragmented replica selection and retrieval in Data Grids

Future Generation Computer Systems
Storage optimization for large-scale distributed stream-processing systems

ACM Transactions on Storage (TOS)
Comparison and analysis of ten static heuristics-based Internet data replication techniques

Journal of Parallel and Distributed Computing
HIPaG: An energy-efficient in-network join for distributed condition tables in sensor networks

Journal of Systems and Software
Design and natural science research on information technology

Decision Support Systems
An approach for solving very large scale instances of the design distribution problem for distributed database systems

ISSADS'05 Proceedings of the 5th international conference on Advanced Distributed Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The allocation of data and operations to nodes in a computer communications network is a critical issue in distributed database design. An efficient distributed database design must trade off performance and cost among retrieval and update activities at the various nodes. It must consider the concurrency control mechanism used as well as capacity constraints at nodes and on links in the network. It must determine where data will be allocated, the degree of data replication, which copy of the data will be used for each retrieval activity, and where operations such as select, project, join, and union will be performed. We develop a comprehensive mathematical modeling approach for this problem. The approach first generates units of data (file fragments) to be allocated from a logical data model representation and a characterization of retrieval and update activities. Retrieval and update activities are then decomposed into relational operations on these fragments. Both fragments and operations on them are then allocated to nodes using a mathematical modeling approach. The mathematical model considers network communication, local processing, and data storage costs. A genetic algorithm is developed to solve this mathematical formulation.