Fine-grained mobility in the Emerald system
ACM Transactions on Computer Systems (TOCS)
Dynamic load balancing for distributed memory multiprocessors
Journal of Parallel and Distributed Computing
The Amber system: parallel programming on a network of multiprocessors
SOSP '89 Proceedings of the twelfth ACM symposium on Operating systems principles
Chare kernel—a runtime support system for parallel computations
Journal of Parallel and Distributed Computing
Orca: A Language for Parallel Programming of Distributed Systems
IEEE Transactions on Software Engineering
Active messages: a mechanism for integrated communication and computation
ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
CHARM++: a portable concurrent object oriented system based on C++
OOPSLA '93 Proceedings of the eighth annual conference on Object-oriented programming systems, languages, and applications
Parallel programming in Split-C
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Parallel load-balancing: an extension to the gradient model
Parallel Computing
Journal of Parallel and Distributed Computing
Cilk: an efficient multithreaded runtime system
PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
CRL: high-performance all-software distributed shared memory
SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
MILLIPEDE: easy parallel programming in available distributed environments
Software—Practice & Experience
Parallel dynamic graph partitioning for adaptive unstructured meshes
Journal of Parallel and Distributed Computing - Special issue on dynamic load balancing
PLUM: parallel load balancing for adaptive unstructured meshes
Journal of Parallel and Distributed Computing
Design of dynamic load-balancing tools for parallel applications
Proceedings of the 14th international conference on Supercomputing
Mobile object layer: a runtime substrate for parallel adaptive and irregular computations
Advances in Engineering Software - Special issue on large-scale analysis, design and intelligent synthesis environments
virtual data space—load balancing for irregular applications
Parallel Computing - special issue on parallel computing for irregular applications
Parallel software abstractions for structured adaptive mesh methods
Journal of Parallel and Distributed Computing
MPI: The Complete Reference
Proceedings of the 18th annual international conference on Supercomputing
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Practical Performance Model for Optimizing Dynamic Load Balancing of Adaptive Applications
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Overhead Analysis of a Dynamic Load Balancing Library for Cluster Computing
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 1 - Volume 02
Proceedings of the 19th annual international conference on Supercomputing
Design and implementation of a novel dynamic load balancing library for cluster computing
Parallel Computing - Heterogeneous computing
An architectural style for high-performance asymmetrical parallel computations
Proceedings of the 28th international conference on Software engineering
Toward real-time image guided neurosurgery using distributed and grid computing
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Parallel unstructured mesh generation by an advancing front method
Mathematics and Computers in Simulation
Algorithm 870: A static geometric Medial Axis domain decomposition in 2D Euclidean space
ACM Transactions on Mathematical Software (TOMS)
Algorithm 872: Parallel 2D constrained Delaunay mesh generation
ACM Transactions on Mathematical Software (TOMS)
Performance evaluation of a dynamic load-balancing library for cluster computing
International Journal of Computational Science and Engineering
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part I: ICCS 2007
Controlling the Behaviour of Database Servers with 2PAC and DiffServ
DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
A multigrain Delaunay mesh generation method for multicore SMT-based architectures
Journal of Parallel and Distributed Computing
A template for developing next generation parallel Delaunay refinement methods
Finite Elements in Analysis and Design
A Space-Based Generic Pattern for Self-Initiative Load Balancing Agents
ESAW '09 Proceedings of the 10th International Workshop on Engineering Societies in the Agents World X
Task assignment on parallel QoS systems
WISE'07 Proceedings of the 8th international conference on Web information systems engineering
A delay-based dynamic load balancing method and its stability analysis and simulation
EuroPar'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part I
Effective out-of-core parallel delaunay mesh refinement using off-the-shelf software
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Effective out-of-core parallel delaunay mesh refinement using off-the-shelf software
Journal of Experimental Algorithmics (JEA)
Adaptive agent allocation for massively multi-agent applications
MMAS'04 Proceedings of the First international conference on Massively Multi-Agent Systems
Hi-index | 0.00 |
Abstract--This paper describes the design of a flexible load balancing framework and runtime software system for supporting the development of adaptive applications on distributed-memory parallel computers. The runtime system supports a global namespace, transparent object migration, automatic message forwarding and routing, and automatic load balancing. These features can be used at the discretion of the application developer in order to simplify program development and to eliminate complex bookkeeping associated with mobile data objects. An evaluation of this system in the context of a three-dimensional tetrahedral advancing front parallel mesh generator shows that overall runtime improvements of 15 percent compared to common stop-and-repartition load balancing methods, 30 percent compared to explicit intrusive load balancing methods, and 42 percent compared to no load balancing are possible on large processor configurations. At the same time, the overheads attributable to the runtime system are a fraction of 1 percent of the total runtime. The parallel advancing front method is a coarse-grained and highly adaptive application and therefore exercises all of the features of the runtime system.