Monitoring distributed systems
ACM Transactions on Computer Systems (TOCS)
Future Generation Computer Systems - Special issue on metacomputing
GridRM: A Resource Monitoring Architecture for the Grid
GRID '02 Proceedings of the Third International Workshop on Grid Computing
Proceedings of the Seventh International Conference on Data Engineering
IC2D: Interactive Control and Debugging of Distribution
LSSC '01 Proceedings of the Third International Conference on Large-Scale Scientific Computing-Revised Papers
A Resource Management Architecture for Metacomputing Systems
IPPS/SPDP '98 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
OMIS 2.0 - A Universal Interface for Monitoring Systems
Proceedings of the 4th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
A Directory Service for Configuring High-Performance Distributed Computations
HPDC '97 Proceedings of the 6th IEEE International Symposium on High Performance Distributed Computing
Autopilot: Adaptive Control of Distributed Applications
HPDC '98 Proceedings of the 7th IEEE International Symposium on High Performance Distributed Computing
Monitoring distributed systems: a relational approach
Monitoring distributed systems: a relational approach
The Anatomy of the Grid: Enabling Scalable Virtual Organizations
International Journal of High Performance Computing Applications
The GrADS Project: Software Support for High-Level Grid Application Development
International Journal of High Performance Computing Applications
The Tau Parallel Performance System
International Journal of High Performance Computing Applications
Automatic Grid workflow based on imperative programming languages: Research Articles
Concurrency and Computation: Practice & Experience - Workflow in Grid Systems
Concurrency and Computation: Practice & Experience
ASKALON: A Grid Application Development and Computing Environment
GRID '05 Proceedings of the 6th IEEE/ACM International Workshop on Grid Computing
Job monitoring and steering in D-Grid's High Energy Physics Community Grid
Future Generation Computer Systems
Future Generation Computer Systems
GridICE: a monitoring service for Grid systems
Future Generation Computer Systems - Special issue: High-speed networks and services for data-intensive grids: The DataTAG project
A taxonomy of grid monitoring systems
Future Generation Computer Systems
Debugging a Distributed Computing System
IEEE Transactions on Software Engineering
Towards autonomic detection of SLA violations in Cloud infrastructures
Future Generation Computer Systems
DARGOS: A highly adaptable and scalable monitoring architecture for multi-tenant Clouds
Future Generation Computer Systems
Hi-index | 0.00 |
We present the design and implementation of a general task monitoring and steering system for Grid applications (GSTAT). The system is integrated in the GRID superscalar (GRIDSs) programming framework. Information at the application, Grid node, and individual task levels are supplied upon request. Using the steering capabilities, individual tasks or the whole application can be cancelled. The corresponding jobs can be restarted using fault tolerance and checkpointing capabilities based on GRIDSs. In addition, the computational resources assigned to an application can be modified. GSTAT is tested using high throughput and high performance computing cases on an Internet-based Grid of computers.