Writing-all deterministically and optimally using a non-trivial number of asynchronous processors

Authors:
Dariusz R. Kowalski;Alexander A. Shvartsman
Affiliations:
Max-Planck-Institut für Informatik, Saarbrücken, Germany and Uniwersytet Warszawski, Warszawa, Poland;University of Connecticut, Storrs, CT and MIT CSAIL, Cambridge, MA
Venue:
Proceedings of the sixteenth annual ACM symposium on Parallelism in algorithms and architectures
Year:
2004

Citing 21
Cited 4

A more practical PRAM model

SPAA '89 Proceedings of the first annual ACM symposium on Parallel algorithms and architectures
The expected advantage of asynchrony

SPAA '90 Proceedings of the second annual ACM symposium on Parallel algorithms and architectures
Efficient robust parallel computations

STOC '90 Proceedings of the twenty-second annual ACM symposium on Theory of computing
Combining tentative and definite executions for very fast dependable parallel computing

STOC '91 Proceedings of the twenty-third annual ACM symposium on Theory of computing
Achieving optimal CRCW PRAM fault-tolerance

Information Processing Letters
Efficient program transformations for resilient parallel computation via randomization (preliminary version)

STOC '92 Proceedings of the twenty-fourth annual ACM symposium on Theory of computing
Work-optimal asynchronous algorithms for shared memory parallel computers

SIAM Journal on Computing
On the complexity of certified write-all algorithms

Journal of Algorithms
A Model for Asynchronous Shared Memory Parallel Computation

SIAM Journal on Computing
Time-optimal message-efficient work performance in the presence of faults

PODC '94 Proceedings of the thirteenth annual ACM symposium on Principles of distributed computing
Parallel algorithms with processor failures and delays

Journal of Algorithms
Constructions of permutation arrays for certain scheduling cost measures

Random Structures & Algorithms
Algorithms for the Certified Write-All Problem

SIAM Journal on Computing
The art of computer programming, volume 3: (2nd ed.) sorting and searching

The art of computer programming, volume 3: (2nd ed.) sorting and searching
Performing Work Efficiently in the Presence of Faults

SIAM Journal on Computing
Towards practical deteministic write-all algorithms

Proceedings of the thirteenth annual ACM symposium on Parallel algorithms and architectures
Fault-Tolerant Parallel Computation

Fault-Tolerant Parallel Computation
Resolving message complexity of Byzantine Agreement and beyond

FOCS '95 Proceedings of the 36th Annual Symposium on Foundations of Computer Science
A work-optimal deterministic algorithm for the asynchronous certified write-all problem

Proceedings of the twenty-second annual symposium on Principles of distributed computing
Parallel processing on networks of workstations: a fault-tolerant, high performance approach

ICDCS '95 Proceedings of the 15th International Conference on Distributed Computing Systems
An algorithm for the asynchronous Write-All problem based on process collision

Distributed Computing

Cooperative asynchronous update of shared memory

Proceedings of the thirty-seventh annual ACM symposium on Theory of computing
A tight analysis and near-optimal instances of the algorithm of Anderson and Woll

Theoretical Computer Science
At-most-once semantics in asynchronous shared memory

DISC'09 Proceedings of the 23rd international conference on Distributed computing
Fast randomized test-and-set and renaming

DISC'10 Proceedings of the 24th international conference on Distributed computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The problem of performing n tasks on p asynchronous or undependable processors is a basic problem in distributed computing. This paper considers an abstraction of this problem called Write-All: using p processors write 1's into all locations of an array of size n. In this problem writing 1 abstracts the notion of performing a simple task. Despite substantial research, there is a dearth of efficient deterministic asynchronous algorithms for Write-All. Efficiency of algorithms is measured in terms of work that accounts for all local steps performed by the processors in solving the problem. Thus an optimal algorithm would have work Θ(n), however it is known that optimality cannot be achieved when p=Ω(n). The quest then is to obtain work-optimal solutions for this problem using a non-trivial, compared to n, number of processors p. Recently it was shown that optimality can be achieved using a non-trivial number M of processors, where M=4√n/log n. The new result in this paper significantly extends the range of processors for which optimality is achieved. The result shows that optimality can be achieved using close to M2 processors; more precisely, using (M log M)2-ε processors, for any ε 0. Additionally, the new result uses only the atomic read/write memory, without resorting to using the test-and-set primitive that was necessary in the previous solution. This paper presents the algorithm and gives its analysis showing that the work complexity of the algorithm is O(n+p2+ε), which is optimal when p = O(n1/(2+ε)), while all prior deterministic algorithms require super-linear work when p=Ω(n1/4).