MegaProto: A Low-Power and Compact Cluster for High-Performance Computing

  • Authors:
  • Hiroshi Nakashima;Hiroshi Nakamura;Mitsuhisa Sato;Taisuke Boku;Satoshi Matsuoka;Daisuke Takahashi;Yoshihiko Hotta

  • Affiliations:
  • Toyohashi University of Technology;University of Tokyo;University of Tsukuba;University of Tsukuba;Tokyo Institute of Technology;University of Tsukuba;University of Tsukuba

  • Venue:
  • IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 11 - Volume 12
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

"MegaProto" is a proof-of-concept prototype for our project "Mega-Scale Computing Based on Low-Power Technology and Workload Modeling", implementing our key idea that a million-scale parallel system should be built with densely mounted low-power commodity processors. It also serves as a platformto implement and evaluate our new technologies such as power conscious compilation, highly reliable and high performance networking, highly dependable cluster management, and multi-level scalable parallel programming. The building block of the MegaProto is a 1U-high 19 inch-rack mountable motherboard unit on which 16 low-power, one-dollar note-sized, commodity PCarchitecture daughterboards are mounted with a high bandwidth, 2Gbps per processor network based on Gigabit Ethernet. The peak performance of each unit is 14.4GFlops for the first version and will improve to 38.4GFlops in the second version through a processor/daughterboard upgrade. The intra- and inter-unit network bandwidths are 32Gbps and 16Gbps respectively. As for power consumption, the entire unit idles at less than 150W and consumes 300-330W maximum under extreme computational stress; this is comparable to or better than conventional 1U servers comprised of dual high-performance, power hungry processors, while benchmarks exhibit up to 279% superior performance for some NPB programs. This demonstrates that higher performance can be achieved with low-power, densely populated architectures with commodity components.