NVisionCC: a visualization framework for high performance cluster security
Proceedings of the 2004 ACM workshop on Visualization and data mining for computer security
MRNet: A Software-Based Multicast/Reduction Network for Scalable Tools
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
IMPuLSE: integrated monitoring and profiling for large-scale environments
LCR '04 Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems
InteMon: continuous mining of sensor data in large-scale self-infrastructures
ACM SIGOPS Operating Systems Review
PARSE: A Tool for Parallel Application Run Time Sensitivity Evaluation
ICPADS '06 Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1
Job Centric Cluster Monitoring
ICPADS '06 Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1
The Node Monitoring Component of a Scalable Systems Software Environment
ICPADS '06 Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1
InteMon: intelligent system monitoring on large clusters
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Intelligent system monitoring on large clusters
DMSN '06 Proceedings of the 3rd workshop on Data management for sensor networks: in conjunction with VLDB 2006
Holistic aggregate resource environment
ACM SIGOPS Operating Systems Review
Lessons learned at 208K: towards debugging millions of cores
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Client-Centric Performance Analysis of a High-Availability Cluster
ISAS '07 Proceedings of the 4th international symposium on Service Availability
Observing Performance Dynamics Using Parallel Profile Snapshots
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
A mechanism of automated monitoring deployment in grid environment
Proceedings of the 5th International ICST Conference on Heterogeneous Networking for Quality, Reliability, Security and Robustness
Scalable data center provisioning and control
IBM Journal of Research and Development
Tree-based overlay networks for scalable applications
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
A database-centric approach to system managemant in the blue gene/L supercomputer
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
An efficient management and automatic failover on a large-scale cluster monitoring system
ICOSSSE '09 Proceedings of the 8th WSEAS international conference on System science and simulation in engineering
A network performance sensitivity metric for parallel applications
International Journal of High Performance Computing and Networking
A flexible architecture integrating monitoring and analytics for managing large-scale data centers
Proceedings of the 8th ACM international conference on Autonomic computing
TAUmon: scalable online performance data analysis in TAU
Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
CGSV: an adaptable stream-integrated grid monitoring system
NPC'05 Proceedings of the 2005 IFIP international conference on Network and Parallel Computing
TA UoverSupermon: low-overhead online parallel performance monitoring
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
A network performance sensitivity metric for parallel applications
ISPA'07 Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications
Hi-index | 0.00 |
Supermon is a exible set of tools for high speed, scalable cluster monitoring. Node behavior can be monitored much faster than with other commonly used methods (e.g., rstatd). In addition, Supermon uses a data protocol based on symbolic expressions (S-expressions) at all levels of Supermon, from individual nodes to entire clusters. This contributes to Supermon's scalability and allows it to function in a heterogeneous environment. This paper presents the Supermon architecture and discuss initial performance measurements on a cluster of heterogeneous Alpha-processor based nodes.