AgentTeamwork: Coordinating grid-computing jobs with mobile agents

  • Authors:
  • Munehiro Fukuda;Koichi Kashiwagi;Shinya Kobayashi

  • Affiliations:
  • Computing & Software Systems, University of Washington, Bothell;Computer Science, Ehime University, Japan;Computer Science, Ehime University, Japan

  • Venue:
  • Applied Intelligence
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

AgentTeamwork is a grid-computing middleware system that dispatches a collection of mobile agents to coordinate a user job over remote computing nodes in a decentralized manner. Its utmost focus is to maintain high availability and dynamic balancing of distributed computing resources to a parallel-computing job. For this purpose, a mobile agent is assigned to each process engaged in the same job, monitors its execution at a different machine, takes its periodical execution snapshot, moves it to a lighter-loaded machine, and resumes it from the latest snapshot upon an accidental crash. The system also restores broken inter-process communication involved in the same job using its error-recoverable socket and mpiJava libraries in collaboration among mobile agents.We have implemented the first version of our middleware including a mobile agent execution platform, error-recoverable socket and mpiJava API libraries, a job wrapper program, and several types of mobile agents such as commander, resource, sentinel, and bookkeeper agents, each orchestrating, allocating resources to, monitoring and maintaining snapshots of a user process respectively. This paper presents AgentTeamwork's execution model, its implementation techniques, and our performance evaluation using the Java Grande benchmark test programs.