Efficient dispersal of information for security, load balancing, and fault tolerance
Journal of the ACM (JACM)
A bridging model for parallel computation
Communications of the ACM
The available capacity of a privately owned workstation environment
Performance Evaluation
Lua—an extensible extension language
Software—Practice & Experience
Unsupervised learning
Feasibility of a serverless distributed file system deployed on an existing set of desktop PCs
Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Peer-to-Peer Membership Management for Gossip-Based Protocols
IEEE Transactions on Computers
Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems
Middleware '01 Proceedings of the IFIP/ACM International Conference on Distributed Systems Platforms Heidelberg
MPICH-V: toward a scalable fault tolerant MPI for volatile nodes
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
MPICH-G2: a Grid-enabled implementation of the Message Passing Interface
Journal of Parallel and Distributed Computing - Special issue on computational grids
Faults in Grids: Why are they so bad and What can be done about it?
GRID '03 Proceedings of the 4th International Workshop on Grid Computing
Distributed computing in practice: the Condor experience: Research Articles
Concurrency and Computation: Practice & Experience - Grid Performance
Concurrency and Computation: Practice & Experience - Middleware for Grid Computing
The implementation of the BSP parallel computing model on the InteGrade Grid middleware
MGC '05 Proceedings of the 3rd international workshop on Middleware for grid computing
SBAC-PAD '05 Proceedings of the 17th International Symposium on Computer Architecture on High Performance Computing
Trust in large-scale computational grids: an SPKI/SDSI extension for representing opinion
Proceedings of the 4th international workshop on Middleware for grid computing
Grid Computing: The New Frontier of High Performance Computing
Grid Computing: The New Frontier of High Performance Computing
Design and Implementation of a Middleware for Data Storage in Opportunistic Grids
CCGRID '07 Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid
A group membership service for large-scale grids
Proceedings of the 6th international workshop on Middleware for grid computing
Resource use pattern analysis for opportunistic grids
Proceedings of the 6th international workshop on Middleware for grid computing
Cluster Analysis
Grid Computing: Techniques and Applications
Grid Computing: Techniques and Applications
A gossip-style failure detection service
Middleware '98 Proceedings of the IFIP International Conference on Distributed Systems Platforms and Open Distributed Processing
Resource use pattern analysis for predicting resource availability in opportunistic grids
Concurrency and Computation: Practice & Experience - Advanced Scheduling Strategies and Grid Programming Environments
Generating parallel algorithms for cluster and grid computing
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part I
Hi-index | 0.00 |
The InteGrade project is a multi-university effort to build a novel grid computing middleware based on the opportunistic use of resources belonging to user workstations. The InteGrade middleware currently enables the execution of sequential, bag-of-tasks, and parallel applications that follow the BSP or the MPI programming models. This article presents the lessons learned over the last five years of the InteGrade development and describes the solutions achieved concerning the support for robust application execution. The contributions cover the related fields of application scheduling, execution management, and fault tolerance. We present our solutions, describing their implementation principles and evaluation through the analysis of several experimental results.