Paravirtualization effect on single- and multi-threaded memory-intensive linear algebra software

  • Authors:
  • Lamia Youseff;Keith Seymour;Haihang You;Dmitrii Zagorodnov;Jack Dongarra;Rich Wolski

  • Affiliations:
  • Dept. of Computer Science, University of California, Santa Barbara, USA;Dept. of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, USA;Dept. of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, USA;Dept. of Computer Science, University of California, Santa Barbara, USA;Dept. of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, USA;Dept. of Computer Science, University of California, Santa Barbara, USA

  • Venue:
  • Cluster Computing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Previous studies have revealed that paravirtualization imposes minimal performance overhead on High Performance Computing (HPC) workloads, while exposing numerous benefits for this field. In this study, we are investigating the impact of paravirtualization on the performance of automatically-tuned software systems. We compare peak performance, performance degradation in constrained memory situations, performance degradation in multi-threaded applications, and inter-VM shared memory performance. For comparison purposes, we examine the proficiency of ATLAS, a quintessential example of an autotuning software system, in tuning the BLAS library routines for paravirtualized systems. Our results show that the combination of ATLAS and Xen paravirtualization delivers native execution performance and nearly identical memory hierarchy performance profiles in both single and multi-threaded scenarios. Furthermore, we show that it is possible to achieve memory sharing among OS instances at native speeds. These results expose new benefits to memory-intensive applications arising from the ability to slim down the guest OS without influencing the system performance. In addition, our findings support a novel and very attractive deployment scenario for computational science and engineering codes on virtual clusters and computational clouds.