Design and Implementation of an RPC-Based ARC Kernel

  • Authors:
  • L. Aruna;Yamini Sharma;Rushikesh K. Joshi

  • Affiliations:
  • -;-;-

  • Venue:
  • HPCN Europe 2001 Proceedings of the 9th International Conference on High-Performance Computing and Networking
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Anonymous Remote Computing (ARC) is a programming paradigm for parallel and distributed computing on workstation clusters. Workstation clusters are characterized by heterogeneity, node/link failures and changing loads. Typically, a parallel program may not have any control over the changing load patterns. Stealing idle cycles on such systems require that parallel programs should adapt themselves dynamically to changing load patterns. We present a design and implementation of an RPC-based ARC kernel supporting parallel programming through ARC Function Calls in such an environment. ARC Function Calls in a C program are executed on anonymous remote machines making the distribution transparent to the parallel programmer. A Horse Power Factor (HPF) primitive characterizes load and speed for the use of task distribution in a parallel program. The kernel supports fault tolerance by awarding failed tasks to available nodes. Nodes can join and leave dynamically at any time during execution. The kernel was designed using object oriented techniques and implemented as a collection of collaborating RPC servers running on a a Linux cluster. The performance and overheads of implementation have also been discussed.