Reducing I/O Complexity by Simulating Coarse Grained Parallel Algorithms

Authors:
Frank K. H. A. Dehne;David Hutchinson;Anil Maheshwari;Wolfgang Dittrich
Affiliations:
-;-;-;-
Venue:
IPPS '99/SPDP '99 Proceedings of the 13th International Symposium on Parallel Processing and the 10th Symposium on Parallel and Distributed Processing
Year:
1999

Citing 0
Cited 11

Distribution sort with randomized cycle

SODA '01 Proceedings of the twelfth annual ACM-SIAM symposium on Discrete algorithms
External memory algorithms and data structures: dealing with massive data

ACM Computing Surveys (CSUR)
Parallelizing the Data Cube

Distributed and Parallel Databases - Special issue: Parallel and distributed data mining
Parallelizing the Data Cube

ICDT '01 Proceedings of the 8th International Conference on Database Theory
Seamless Integration of Parallelism and Memory Hierarchy

ICALP '02 Proceedings of the 29th International Colloquium on Automata, Languages and Programming
Duality between Prefetching and Queued Writing with Parallel Disks

ESA '01 Proceedings of the 9th Annual European Symposium on Algorithms
External memory algorithms

Handbook of massive data sets
Engineering a compact parallel delaunay algorithm in 3D

Proceedings of the twenty-second annual symposium on Computational geometry
Efficient parallel Text Retrieval techniques on Bulk Synchronous Parallel (BSP)/Coarse Grained Multicomputers (CGM)

The Journal of Supercomputing
Algorithms for memory hierarchies: advanced lectures

Algorithms for memory hierarchies: advanced lectures
Cache-oblivious simulation of parallel programs

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Block-wise access to data is a central theme in the design of efficient {\em external memory} (EM) algorithms. A second important issue, when more than one disk is present, is fully parallel disk I/O. In this paper we present a {\em deterministic simulation} technique which transforms parallel algorithms into (parallel) external memory algorithms. Specifically, we present a {\em deterministic} simulation technique which transforms {\em Coarse Grained Multicomputer} (CGM) algorithms into external memory algorithms for the {\em Parallel Disk Model}. Our technique optimizes block-wise data access and parallel disk I/O and, at the same time, utilizes {\em multiple processors} connected via a communication network or shared memory.We obtain new improved parallel external memory algorithms for a large number of problems including sorting, permutation, matrix transpose, several geometric and GIS problems including 3D convex hulls (2D Voronoi diagrams), and various graph problems. All of the (parallel) external memory algorithms obtained via simulation are analyzed with respect to the computation time, communication time and the number of I/O's. Our results answer to the challenge posed by the ACM working group on storage I/O for large-scale computing