Decentralizing control and intelligence in network management
Proceedings of the fourth international symposium on Integrated network management IV
PARMON: a portable and scalable monitoring system for clusters
Software—Practice & Experience
A scalable SNMP-based distibuted monitoring system for heterogeneous network computing
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
ClusterProbe: An Open, Flexible and Scalable Cluster Monitoring Tool
IWCC '99 Proceedings of the 1st IEEE Computer Society International Workshop on Cluster Computing
Hi-index | 0.00 |
Fast real-time monitoring of system information is important to the understanding of parallel system especially for a large cluster system that appeared recently. Making the system fast and scalable at the same time is still a challenging task. This paper presents the design and implementation of a fast and real time monitoring system called SCMS/RMS. This system is a part of more comprehensive cluster management tool called SCMS. SCMS/RMS is designed to be flexible, highly scalable, and efficient. Many techniques that are used to increase the monitoring speed and to achieve high scalability have been described in this paper. The experiment has been conducted on a 72 nodes Beowulf Cluster and the results show that SCMS/RMS is very fast and highly scalable.