Computation migration: enhancing locality for distributed-memory parallel systems

Authors:
Wilson C. Hsieh;Paul Wang;William E. Weihl
Affiliations:
-;-;-
Venue:
PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
Year:
1993

Citing 31
Cited 19

Concurrent operations on B*-trees with overtaking

Journal of Computer and System Sciences
Architecture of a message-driven processor

ISCA '87 Proceedings of the 14th annual international symposium on Computer architecture
Fine-grained mobility in the Emerald system

ACM Transactions on Computer Systems (TOCS)
The Amber system: parallel programming on a network of multiprocessors

SOSP '89 Proceedings of the twelfth ACM symposium on Operating systems principles
Performance of the Firefly RPC

ACM Transactions on Computer Systems (TOCS)
Munin: distributed shared memory based on type-specific memory coherence

PPOPP '90 Proceedings of the second ACM SIGPLAN symposium on Principles & practice of parallel programming
Counting networks and multi-processor coordination

STOC '91 Proceedings of the twenty-third annual ACM symposium on Theory of computing
LimitLESS directories: A scalable cache coherence scheme

ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
Using continuations to implement thread management and communication in operating systems

SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
Compiler-controlled multithreading for lenient parallel languages

Proceedings of the 5th ACM conference on Functional programming languages and computer architecture
PROTEUS: a high-performance parallel-architecture simulator

SIGMETRICS '92/PERFORMANCE '92 Proceedings of the 1992 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Active messages: a mechanism for integrated communication and computation

ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
Low contention load balancing on large-scale multiprocessors

SPAA '92 Proceedings of the fourth annual ACM symposium on Parallel algorithms and architectures
Cache Invalidation Patterns in Shared-Memory Multiprocessors

IEEE Transactions on Computers
A tightly-coupled processor-network interface

ASPLOS V Proceedings of the fifth international conference on Architectural support for programming languages and operating systems
A framework for the performance analysis of concurrent B-tree algorithms

PODS '90 Proceedings of the ninth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Efficient locking for concurrent operations on B-trees

ACM Transactions on Database Systems (TODS)
Preemptable remote execution facilities for the V-system

Proceedings of the tenth ACM symposium on Operating systems principles
A symmetric concurrent B-tree algorithm

ACM '86 Proceedings of 1986 ACM Fall joint computer conference
The directory-based cache coherence protocol for the DASH multiprocessor

ISCA '90 Proceedings of the 17th annual international symposium on Computer Architecture
Ubiquitous B-Tree

ACM Computing Surveys (CSUR)
Implementing remote procedure calls

ACM Transactions on Computer Systems (TOCS)
A Distributed Data-Balanced Dictionary Based on the B-Link Tree

IPPS '92 Proceedings of the 6th International Parallel Processing Symposium
Supporting SPMD Execution for Dynamic Data Structures

Proceedings of the 5th International Workshop on Languages and Compilers for Parallel Computing
An Overview of the Fortran D Programming System

Proceedings of the Fourth International Workshop on Languages and Compilers for Parallel Computing
Process migration in DEMOS/MP

SOSP '83 Proceedings of the ninth ACM symposium on Operating systems principles
PRELUDE: A SYSTEM FOR PORTABLE PARALL

PRELUDE: A SYSTEM FOR PORTABLE PARALL
Replication Control in Distributed B-Trees

Replication Control in Distributed B-Trees
THE MIT ALEWIFE MACHINE: A LARGE-SCALE DISTRIBUTED-MEMORY MULTIPROCESSOR

THE MIT ALEWIFE MACHINE: A LARGE-SCALE DISTRIBUTED-MEMORY MULTIPROCESSOR
Load Balancing vs. Locality Management in Shared-Memory Multiprocessors

Load Balancing vs. Locality Management in Shared-Memory Multiprocessors
Improving Processor and Cache Locality in Fine-Grain Parallel Computations using Object-Affinity Scheduling and Continuation Passing

Improving Processor and Cache Locality in Fine-Grain Parallel Computations using Object-Affinity Scheduling and Continuation Passing

Developing parallel applications using high-performance simulation

PADD '93 Proceedings of the 1993 ACM/ONR workshop on Parallel and distributed debugging
Concert-efficient runtime support for concurrent object-oriented programming languages on stock hardware

Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Supporting dynamic data structures on distributed-memory machines

ACM Transactions on Programming Languages and Systems (TOPLAS)
Software caching and computation migration in Olden

PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Cilk: an efficient multithreaded runtime system

PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Higher-order distributed objects

ACM Transactions on Programming Languages and Systems (TOPLAS)
CRL: high-performance all-software distributed shared memory

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
Communication optimizations for parallel computing using data access information

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Space-efficient implementation of nested parallelism

PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
Anonymous Remote Computing: A Paradigm for Parallel Programming on Interconnected Workstations

IEEE Transactions on Software Engineering
Space-efficient scheduling of nested parallelism

ACM Transactions on Programming Languages and Systems (TOPLAS)
Dynamic computation migration in DSM systems

Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Simulation of the 3 dimensional cascade flow with numerical wind tunnel (NWT)

Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Runtime optimizations for a Java DSM implementation

Proceedings of the 2001 joint ACM-ISCOPE conference on Java Grande
Source-level global optimizations for fine-grain distributed shared memory systems

PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
Supporting dynamic data structures with Olden

Compiler optimizations for scalable parallel systems
Meta-Level Architecture Support for distributed Objects

IWOOOS '95 Proceedings of the 4th International Workshop on Object-Orientation in Operating Systems
An orchestration language for parallel objects

LCR '04 Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems
The RDF virtual machine

Knowledge-Based Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We describe computation migration, a new technique that is based on compile-time program transformations, for accesing remote data in a distributed-memory parallel system. In contrast with RPC-style access, where the access is performed remotely, and with data migration, where the data is moved so that it is local, computation migration moves part of the current thread to the processor where the data resides. The access is performed at the remote processor, and the migrated thread portion continues to run on that same processor; this makes subsequent accesses in the thread portion local.We describe an implementation of computation migration that consists of two parts: an implementation that migrates single activation frames, and a high-level language annotation that allows a programmer to express when migration is desired. We performed experiments using two applications; these experiments demonstrate that computation migration is a valuable alternative to RPC and data migration.