The Hector Distributed Run-Time Environment

Authors:
Samuel H. Russ;Jonathan Robinson;Brian K. Flachs;Bjørn Heckel
Affiliations:
-;-;-;-
Venue:
IEEE Transactions on Parallel and Distributed Systems
Year:
1998

Citing 14
Cited 10

Monitors, messages, and clusters: the p4 parallel programming system

Parallel Computing - Special issue: message passing interfaces
PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing

PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing
Hector: An Agent-Based Architecture for Dynamic Resource Management

IEEE Concurrency
Visualization and Debugging in a Heterogeneous Environment

Computer
Software-Based Replication for Fault Tolerance

Computer
Hector: Automated Task Allocation for MPI

IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Using Runtime Measured Workload Characteristics in Parallel Processor Scheduling

IPPS '96 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Managing Checkpoints for Parallel Programs

IPPS '96 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
A Historical Application Profiler for Use by Parallel Schedulers

IPPS '97 Proceedings of the Job Scheduling Strategies for Parallel Processing
Theory and Practice in Parallel Job Scheduling

IPPS '97 Proceedings of the Job Scheduling Strategies for Parallel Processing
Portable checkpointing and recovery

HPDC '95 Proceedings of the 4th IEEE International Symposium on High Performance Distributed Computing
A Task Migration Implementation of the Message-Passing Interface

HPDC '96 Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
Memory Space Representation for Heterogeneous Network Process Migration

IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Utilization and Predictability in Scheduling the IBM SP2 with Backfilling

IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium

Exploiting Fine-Grained Idle Periods in Networks of Workstations

IEEE Transactions on Parallel and Distributed Systems
Hector: An Agent-Based Architecture for Dynamic Resource Management

IEEE Concurrency
Coscheduling under Memory Constraints in a NOW Environment

JSSPP '01 Revised Papers from the 7th International Workshop on Job Scheduling Strategies for Parallel Processing
Implementing and Analysing an Effective Explicit Coscheduling Algorithm on a NOW

VECPAR '00 Selected Papers and Invited Talks from the 4th International Conference on Vector and Parallel Processing
ATOP-space and time adaptation for parallel and grid applications via flexible data partitioning

ARM '04 Proceedings of the 3rd workshop on Adaptive and reflective middleware
On the Scalability of Centralized Control

IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 18 - Volume 19
Design and Implementation of Multiple Fault-Tolerant MPI over Myrinet (M^3)

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Time and space adaptation for computational grids with the ATOP-Grid middleware

Future Generation Computer Systems
WE-AMBLE: a Workflow Engine To Manage Awareness in Collaborative Grid Environments

International Journal of High Performance Computing Applications
Performance evaluation of consistent recovery protocols using MPICH-GF

EDCC'05 Proceedings of the 5th European conference on Dependable Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Harnessing the computational capabilities of a network of workstations promises to off-load work from overloaded supercomputers onto largely idle resources overnight. Several capabilities are needed to do this, including support for an architecture-independent parallel programming environment, task migration, automatic resource allocation, and fault tolerance. The Hector distributed run-time environment is designed to present these capabilities transparently to programmers. MPI programs can be run under this environment on homogeneous clusters with no modifications to their source code needed. The design of Hector, its internal structure, and several benchmarks and tests are presented.