The input/output complexity of sorting and related problems
Communications of the ACM
A case for redundant arrays of inexpensive disks (RAID)
SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
Large-scale sorting in parallel memories (extended abstract)
SPAA '91 Proceedings of the third annual ACM symposium on Parallel algorithms and architectures
Average-case analysis of algorithms and data structures
Handbook of theoretical computer science (vol. A)
Deterministic distribution sort in shared and distributed memory multiprocessors
SPAA '93 Proceedings of the fifth annual ACM symposium on Parallel algorithms and architectures
Markov analysis of multiple-disk prefetching strategies for external merging
Theoretical Computer Science - Special issue on dependable parallel computing
The art of computer programming, volume 3: (2nd ed.) sorting and searching
The art of computer programming, volume 3: (2nd ed.) sorting and searching
Optimal parallel sorting in multi-level storage
SODA '94 Proceedings of the fifth annual ACM-SIAM symposium on Discrete algorithms
Communications of the ACM
I/O complexity: The red-blue pebble game
STOC '81 Proceedings of the thirteenth annual ACM symposium on Theory of computing
Massive Information Storage, Management, and Use
Massive Information Storage, Management, and Use
A simple and efficient parallel disk mergesort
Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures
LEDA-SM Extending LEDA to Secondary Memory
WAE '99 Proceedings of the 3rd International Workshop on Algorithm Engineering
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Hi-index | 0.00 |
We consider the problem of sorting a file of N records on theD-disk model of parallel I/0 [VS94] in which there are two sourcesof parallehsm. Records are transferred to and from diskconcurrently in blocks of B con-tiguous records. In each I/Ooperation, up to one block can be transferred to or from each ofthe D disks in parallel. We propose a simple, eficient, randomizedmergesort algorithm called SRM that uses a forecast-and-flushapproach to overcome the inherent difficulties of simple merging onparallel disks. SRM exhibits a limited use of randomization andalso has a useful deterministic version. Generalizing theforecasting technique of [Knu73], our algorithm, is able to readin, at any time, the right block from any disk, and using thetechnique of flushing, our algorithm evicts, without any I/0overhead, just the right blocks from memory to make space for newones to be read in. The disk layout of SRM is such that it enjoysperfect write parallelism, avoiding fundamental inefficiencies ofprevious mergesort algorithms. Our analysis technique involves anovel reduction to various maximum occupancy problems. We provethat the expected I/O performance of SRM is efficient under varyingsizes of memory and that it compares favorably in practice todisk-striped mergesort (DSM). Our studies indicate that SRMoutperforms DSM even when the number D of parallel disks is fairlysmall.