The interaction of parallel and sequential workloads on a network of workstations
Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
GLUnix: a global layer Unix for a network of workstations
Software—Practice & Experience - Special issue on multiprocessor operating systems
Time, clocks, and the ordering of events in a distributed system
Communications of the ACM
A scalable SNMP-based distibuted monitoring system for heterogeneous network computing
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
DNS and BIND
Distributed management by delegation
ICDCS '95 Proceedings of the 15th International Conference on Distributed Computing Systems
A survey of peer-to-peer content distribution technologies
ACM Computing Surveys (CSUR)
Designing the Architecture of P2P-Based Network Management Systems
ISCC '06 Proceedings of the 11th IEEE Symposium on Computers and Communications
Improving network management with mobile agents in peer-to-peer networks
Proceedings of the 2008 ACM symposium on Applied computing
Hi-index | 0.00 |
We present PeerMon, a peer-to-peer resource monitoring system for general purpose Unix local area network (LAN) systems. PeerMon is designed to monitor system resources on a single LAN, but it also could be deployed on several LANs where some inter-LAN resource sharing is supported. Its peer-to-peer design makes Peer-Mon a scalable and fault tolerant monitoring system for efficiently collecting system-wide resource usage information. Experiments evaluating PeerMon's performance show that it adds little additional overhead to the system and that it scales well to large-sized LANs. Peer-Mon was initially designed to be used by system services that provide load balancing and job placement, however, it can be easily extended to providemonitoring data for other system-wide services. We present three tools (smarterSSH, autoMPIgen, and a dynamic DNS binding system) that use PeerMon data to pick "good" nodes for job or process placement in a LAN. Tools using PeerMon data for job placement can greatly improve the performance of applications running on general purpose LANs. We present results showing application speed-ups of up to 4.6 using our tools.