Join processing in database systems with large main memories
ACM Transactions on Database Systems (TODS)
Optimization of large join queries: combining heuristics and combinatorial techniques
SIGMOD '89 Proceedings of the 1989 ACM SIGMOD international conference on Management of data
An adaptive hash join algorithm for multiuser environments
Proceedings of the sixteenth international conference on Very large databases
Tradeoffs in processing complex join queries via hashing in multiprocessor database machines
Proceedings of the sixteenth international conference on Very large databases
Multi-way equijoin query acceleration using hit-lists
Multi-way equijoin query acceleration using hit-lists
Accurate modeling of the hybrid hash join algorithm
SIGMETRICS '94 Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Principles of distributed database systems (2nd ed.)
Principles of distributed database systems (2nd ed.)
Readings in database systems (3rd ed.)
Readings in database systems (3rd ed.)
Join and Semijoin Algorithms for a Multiprocessor Database Machine
ACM Transactions on Database Systems (TODS)
The state of the art in distributed query processing
ACM Computing Surveys (CSUR)
Dynamic memory allocation strategies for parallel query execution
Proceedings of the 2002 ACM symposium on Applied computing
Domain Vector Accelerator for Relational Operations
Proceedings of the Seventh International Conference on Data Engineering
Multi-Join Optimization for Symmetric Multiprocessors
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
Integrating Semi-Join-Reducers into State of the Art Query Processors
Proceedings of the 17th International Conference on Data Engineering
Performance Modeling of the Grace Hash Join on Cluster Architectures
IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
Mobile Agent Cooperation Methods for Large Scale Distributed Dynamic Query Optimization
DEXA '03 Proceedings of the 14th International Workshop on Database and Expert Systems Applications
Improving Hash Join Performance through Prefetching
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Algebraic Signatures for Scalable Distributed Data Structures
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Resource Scheduling for Parallel Query Processing on Computational Grids
GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
Revisiting pipelined parallelism in multi-join query processing
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Distributed Evaluation of Continuous Equi-join Queries over Large Structured Overlay Networks
ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
A Cost-Space Approach to Distributed Query Optimization in Stream Based Overlays
ICDEW '05 Proceedings of the 21st International Conference on Data Engineering Workshops
Locality Aware Networked Join Evaluation
ICDEW '05 Proceedings of the 21st International Conference on Data Engineering Workshops
String-Matching and Update through Algebraic Signatures in Scalable Distributed Data Structures
DEXA '06 Proceedings of the 17th International Conference on Database and Expert Systems Applications
Improving hash join performance through prefetching
ACM Transactions on Database Systems (TODS)
Hardware acceleration in commercial databases: a case study of spatial operations
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Fast nGram-based string search over data encoded using algebraic signatures
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Evolution of Query Optimization Methods
Transactions on Large-Scale Data- and Knowledge-Centered Systems I
Hi-index | 0.00 |
Evaluation of distributed join queries in large scale environments often deals with large data volumes, network with low bandwidth and high number of data sources. Several techniques were proposed to improve join query performances. We propose to combine algebraic signatures with known hash join techniques. Algebraic signatures combined to a semi join based join technique are also very useful for reducing the volume of inter-site transmitted data. We discuss impacts of CPU, I/O and communication costs reductions on the total multi-join query performance improvements. The performance study of our technique shows the reduction of both memory requirements and processing costs especially when the join attribute is a long string. Performances of the semi join based join technique are also significantly improved. We experiment with simple, medium and complex queries in both centralised and distributed environments. Algebraic signatures provide improved performances to reduce significantly communication costs, often the dominant factor in distributed environments.