The impact of noise on the scaling of collectives: a theoretical approach

  • Authors:
  • Saurabh Agarwal;Rahul Garg;Nisheeth K. Vishnoi

  • Affiliations:
  • IBM India Research Lab, Block-1, IIT Delhi, Hauz Khas, New Delhi, India;IBM India Research Lab, Block-1, IIT Delhi, Hauz Khas, New Delhi, India;IBM India Research Lab, Block-1, IIT Delhi, Hauz Khas, New Delhi, India

  • Venue:
  • HiPC'05 Proceedings of the 12th international conference on High Performance Computing
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The performance of parallel applications running on large clusters is known to degrade due to the interference of kernel and daemon activities on individual nodes, often referred to as noise. In this paper, we focus on an important class of parallel applications, which repeatedly perform computation, followed by a collective operation such as a barrier. We model this theoretically and demonstrate, in a rigorous way, the effect of noise on the scalability of such applications. We study three natural and important classes of noise distributions: The exponential distribution, the heavy-tailed distribution, and the Bernoulli distribution. We show that the systems scale well in the presence of exponential noise, but the performance goes down drastically in the presence of heavy-tailed or Bernoulli noise.