A security architecture for computational grids
CCS '98 Proceedings of the 5th ACM conference on Computer and communications security
The grid
Future Generation Computer Systems - Special issue on metacomputing
Using high-speed WANs and network data caches to enable remote and distributed visualization
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Building Web Services with Java: Making Sense of Xml, Soap, Wsdl, and Uddi
Building Web Services with Java: Making Sense of Xml, Soap, Wsdl, and Uddi
Event Services in High Performance Systems
Cluster Computing
Replica Selection in the Globus Data Grid
CCGRID '01 Proceedings of the 1st International Symposium on Cluster Computing and the Grid
Fast Heterogeneous Binary Data Interchange
HCW '00 Proceedings of the 9th Heterogeneous Computing Workshop
The NetLogger Methodology for High Performance Distributed Systems Performance Analysis
HPDC '98 Proceedings of the 7th IEEE International Symposium on High Performance Distributed Computing
Autopilot: Adaptive Control of Distributed Applications
HPDC '98 Proceedings of the 7th IEEE International Symposium on High Performance Distributed Computing
MSS '01 Proceedings of the Eighteenth IEEE Symposium on Mass Storage Systems and Technologies
Grid Information Services for Distributed Resource Sharing
HPDC '01 Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing
The Kangaroo Approach to Data Movement on the Grid
HPDC '01 Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Monitoring data archives for grid environments
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Dynamic Querying of Streaming Data with the dQUOB System
IEEE Transactions on Parallel and Distributed Systems
On-Demand Grid Application Tuning and Debugging with the NetLogger Activation Service
GRID '03 Proceedings of the 4th International Workshop on Grid Computing
Grid resource management
A taxonomy of grid monitoring systems
Future Generation Computer Systems
IEEE Transactions on Parallel and Distributed Systems
Sapphire: Statistical Characterization and Model-Based Adaptation of Networked Applications
IEEE Transactions on Parallel and Distributed Systems
CAMP: a common API for measuring performance
LISA'07 Proceedings of the 21st conference on Large Installation System Administration Conference
A survey-based study of grid traffic
Proceedings of the first international conference on Networks for grid applications
Log summarization and anomaly detection for troubleshooting distributed systems
GRID '07 Proceedings of the 8th IEEE/ACM International Conference on Grid Computing
Hierarchical agent monitoring design approach towards self-aware parallel systems-on-chip
ACM Transactions on Embedded Computing Systems (TECS)
A taxonomy of grid monitoring systems
Future Generation Computer Systems
Monitoring and fault tolerance for real-time online interactive applications
Euro-Par'09 Proceedings of the 2009 international conference on Parallel processing
End-to-end quality of service for high-end applications
Computer Communications
Hi-index | 0.00 |
Developers and users of high-performance distributed systems often observe performance problems such as unexpectedly low throughput or high latency. Determining the source of the performance problems requires detailed end-to-end instrumentation of all components, including the applications, operating systems, hosts, and networks. However, one must be very careful to design the instrumentation to have extremely low overhead, and not affect the system being monitored. In this paper we present a very light-weight instrumentation system that can be dynamically activated to unobtrusively collect and aggregate detailed end-to-end monitoring information from distributed applications. We also show how emerging "Web Services" can be used to facilitate remote interaction with this system.