Optimizing chain queries in a distributed database system.
SIAM Journal on Computing
Optimization of distributed tree queries
Journal of Computer and System Sciences
An Optimal Algorithm for Processing Distributed Star Queries
IEEE Transactions on Software Engineering
Optimization of join operations in horizontally partitioned database systems
ACM Transactions on Database Systems (TODS)
Fragmentation: a technique for efficient query processing
ACM Transactions on Database Systems (TODS)
Join processing in database systems with large main memories
ACM Transactions on Database Systems (TODS)
Set query optimization in distributed database systems
ACM Transactions on Database Systems (TODS)
Algorithms to Process Distributed Queries in Fast Local Networks
IEEE Transactions on Computers
SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
Optimizing Join Queries in Distributed Databases
IEEE Transactions on Software Engineering
Fast parallel algorithms for processing of joins
Proceedings of the 1st International Conference on Supercomputing
On the effect of join operations on relation sizes
ACM Transactions on Database Systems (TODS)
Query size estimation by adaptive sampling (extended abstract)
PODS '90 Proceedings of the ninth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Join and Semijoin Algorithms for a Multiprocessor Database Machine
ACM Transactions on Database Systems (TODS)
Query processing in a system for distributed databases (SDD-1)
ACM Transactions on Database Systems (TODS)
Tree queries: a simple class of relational queries
ACM Transactions on Database Systems (TODS)
Parallel algorithms for the execution of relational database operations
ACM Transactions on Database Systems (TODS)
Implementing a relational database by means of specialzed hardware
ACM Transactions on Database Systems (TODS)
Extending the database relational model to capture more meaning
ACM Transactions on Database Systems (TODS)
Computers and Intractability: A Guide to the Theory of NP-Completeness
Computers and Intractability: A Guide to the Theory of NP-Completeness
Access path selection in a relational database management system
SIGMOD '79 Proceedings of the 1979 ACM SIGMOD international conference on Management of data
A methodology for interpreting tree queries into optimal semi-join expressions
SIGMOD '80 Proceedings of the 1980 ACM SIGMOD international conference on Management of data
A new way to compute the product and join of relations
SIGMOD '80 Proceedings of the 1980 ACM SIGMOD international conference on Management of data
Relational Algebra Machine GRACE
Proceedings of RIMS Symposium on Software Science and Engineering
GAMMA - A High Performance Dataflow Database Machine
VLDB '86 Proceedings of the 12th International Conference on Very Large Data Bases
A new approach to modeling office documents
ACM SIGOIS Bulletin - Special issue: “Do users get what they want?” (DUG'93)
Solving satisfiability and implication problems in database systems
ACM Transactions on Database Systems (TODS)
Adaptive Algorithms for Join Processing in Distributed Database Systems
Distributed and Parallel Databases
A heuristic algorithm for partition strategy in distributed query processing
SAC '96 Proceedings of the 1996 ACM symposium on Applied Computing
On the Complexity of Distributed Query Optimization
IEEE Transactions on Knowledge and Data Engineering
Criss-Cross Hash Joins: Design and Analysis
IEEE Transactions on Knowledge and Data Engineering
Coloring Away Communication in Parallel Query Optimization
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Experimental evidence on partitioning in parallel data warehouses
Proceedings of the 7th ACM international workshop on Data warehousing and OLAP
Efficiently Processing Query-Intensive Databases over a Non-Dedicated Local Network
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
On the size of the search space of join optimization
Journal of Computing Sciences in Colleges
Information Sciences: an International Journal
A systematic approach to the integration of overlapping partitions in service-oriented data grids
Future Generation Computer Systems
PMJoin: optimizing distributed multi-way stream joins by stream partitioning
DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Hierarchical aggregation in networked data management
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Large relations in node-partitioned data warehouses
DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Incorporating processor costs in optimizing the distributed execution of join queries
Mathematical and Computer Modelling: An International Journal
The propagation of updates to relational tables in a distributed database system
Mathematical and Computer Modelling: An International Journal
Allocating relations in a distributed database system
Mathematical and Computer Modelling: An International Journal
Evaluating multiple join queries in a distributed database system
Mathematical and Computer Modelling: An International Journal
Mathematical and Computer Modelling: An International Journal
Executing join queries in an uncertain distributed environment
Mathematical and Computer Modelling: An International Journal
TEEPA: a timely-aware elastic parallel architecture
Proceedings of the 16th International Database Engineering & Applications Sysmposium
Overcoming the scalability limitations of parallel star schema data warehouses
ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Providing timely results with an elastic parallel DW
ISMIS'12 Proceedings of the 20th international conference on Foundations of Intelligent Systems
Hi-index | 0.00 |
Consider the class of distributed database systems consisting of a set of nodes connected by a high bandwidth network. Each node consists of a processor, a random access memory, and a slower but much larger memory such as a disk. There is no shared memory among the nodes. The data are horizontally partitioned often using a hash function. Such a description characterizes many parallel or distributed database systems that have recently been proposed, both commercial and academic. We study the optimization problem that arises when the query processor must repartition the relations and intermediate results participating in a multijoin query. Using estimates of the sizes of intermediate relations, we show (1) optimum solutions for closed chain queries; (2) the NP-completeness of the optimization problem for star, tree, and general graph queries; and (3) effective heuristics for these hard cases.Our general approach and many of our results extend to other attribute partitioning schemes, for example, sort-partitioning on attributes, and to partitioned object databases.