Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Data management and transfer in high-performance computational grid environments
Parallel Computing - Parallel data-intensive algorithms and applications
Proceedings of the Seventh International Conference on Data Engineering
Design of a Tool for Providing Dynamic Network Information to an Application
PaCT '01 Proceedings of the 6th International Conference on Parallel Computing Technologies
The CrossGrid Performance Analysis Tool for Interactive Grid Applications
Proceedings of the 9th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
An Infrastructure for Grid Application Monitoring
Proceedings of the 9th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Computational and data Grids in large-scale science and engineering
Future Generation Computer Systems - Grid computing: Towards a new computing infrastructure
GridMapper: A Tool for Visualizing the Behavior of Large-Scale Distributed Systems
HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
Parallel Computing - Special issue: High performance computing with geographical data
Monitoring of interactive grid applications
Performance analysis and grid computing
A Performance Analysis Tool for Interactive Applications on the Grid
International Journal of High Performance Computing Applications
Secure grid monitoring, a web-based framework
Proceedings of the first international conference on Networks for grid applications
Flexible and Secure Logging of Grid Data Access
GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
Using status messages in the distributed test architecture
Information and Software Technology
VECPAR'02 Proceedings of the 5th international conference on High performance computing for computational science
Scaling up workflow-based applications
Journal of Computer and System Sciences
Bringing introspection into BlobSeer: Towards a self-adaptive distributed data management system
International Journal of Applied Mathematics and Computer Science - SPECIAL SECTION: Efficient Resource Management for Grid-Enabled Applications
Performance monitoring for distributed service oriented grid architecture
ICA3PP'05 Proceedings of the 6th international conference on Algorithms and Architectures for Parallel Processing
Online workflow management and performance analysis with stampede
Proceedings of the 7th International Conference on Network and Services Management
EGC'05 Proceedings of the 2005 European conference on Advances in Grid Computing
Towards autonomic detection of SLA violations in Cloud infrastructures
Future Generation Computer Systems
Hi-index | 0.00 |
Diagnosis and debugging of performance problems on complex distributed systems requires end-to-end performance information at both the application and system level. We describe a methodology, called NetLogger that enables real-time diagnosis of performance problems in such systems. The methodology includes tools for generating precision event logs, an interface to a system event-monitoring framework, and tools for visualizing the log data and real-time state of the distributed system. Low overhead is an important requirement for such tools; therefore, we evaluate efficiency of the monitoring itself. The approach is novel in that it combines network, host, and application-level monitoring, providing a complete view of the entire system.