IEEE Transactions on Software Engineering
PADD '93 Proceedings of the 1993 ACM/ONR workshop on Parallel and distributed debugging
Fully Dynamic Maintenance of k-Connectivity in Parallel
IEEE Transactions on Parallel and Distributed Systems
MPICH-G2: a Grid-enabled implementation of the Message Passing Interface
Journal of Parallel and Distributed Computing - Special issue on computational grids
Dynamic Reconfiguration for Grid Fabrics
GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
HPCS '05 Proceedings of the 19th International Symposium on High Performance Computing Systems and Applications
Comparison of Approaches to Service Deployment
ICDCS '05 Proceedings of the 25th IEEE International Conference on Distributed Computing Systems
First Evaluation of Parallel Methods of Automatic Global Image Registration Based on Wavelets
ICPP '05 Proceedings of the 2005 International Conference on Parallel Processing
The Anatomy of the Grid: Enabling Scalable Virtual Organizations
International Journal of High Performance Computing Applications
Using dependency models to manage complex software architecture
OOPSLA '05 Proceedings of the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
A Scalable P2P Platform for the Knowledge Grid
IEEE Transactions on Knowledge and Data Engineering
Understanding and dealing with operator mistakes in internet services
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Using fault injection and modeling to evaluate the performability of cluster-based services
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
HAND: Highly Available Dynamic Deployment Infrastructure for Globus Toolkit 4
PDP '07 Proceedings of the 15th Euromicro International Conference on Parallel, Distributed and Network-Based Processing
Dependency-aware Maintenance for Dynamic Grid Services
ICPP '07 Proceedings of the 2007 International Conference on Parallel Processing
Automated peer-to-peer security-update propagation network
ICCOMP'07 Proceedings of the 11th WSEAS International Conference on Computers
CGSP: an extensible and reconfigurable grid framework
APPT'05 Proceedings of the 6th international conference on Advanced Parallel Processing Technologies
Hi-index | 0.00 |
When the scale of computational system grow from a single machine to a Grid with potentially thousands of heterogeneous nodes, the interdependencies among the resources and software components make management and maintenance activities much more complicated. One of the most important challenges to overcome is how to balance maintenance of the system and the global system availability. In this paper, a novel mechanism is proposed, the Cobweb Guardian, which provides solutions not only to reduce the effects of maintenance but to remove the effects of dependencies on system availability due to deployment dependencies, invocation dependencies, and environment dependencies. By using the Cobweb Guardian, Grid administrators can execute the maintenance tasks safely at runtime whilst ensuring high system availability. The results of our evaluations show that our proposed dependency-aware maintenance mechanism can significantly increase the throughput and the availability of the whole system at runtime.