Modeling resource usage in planetary-scale shared infrastructures: PlanetLab's case study

  • Authors:
  • Xavier León;Tuan Anh Trinh;Leandro Navarro

  • Affiliations:
  • Distributed Systems Group, Departament d'Arquitectura de Computadors, Universitat Politècnica de Catalunya, Barcelona, Spain;Network Economics Group, Department of Telecommunications and Media Informatics, Budapest University of Technologies and Economics, Hungary;Distributed Systems Group, Departament d'Arquitectura de Computadors, Universitat Politècnica de Catalunya, Barcelona, Spain

  • Venue:
  • Computer Networks: The International Journal of Computer and Telecommunications Networking
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Understanding how different applications with different resource requirements interact in a shared infrastructure like Internet is crucial in the design of resource allocation and regulation policies. In this paper, our focus is on analyzing a well known planetary scale experimental facility as a case study - PlanetLab. Previous analysis focused on the workload characterization from the node point of view, while little attention has been paid to the processes leading to such workload - i.e. distributed applications. In particular, our aim is twofold: (i) characterize the underlying nature of interactions among distributed applications in a collaborative and shared infrastructure. In other words, understand the interference or mutual influence of one slice on other slices and the effect on the overall consumption of slices over the infrastructure, and (ii) characterize two important aspects of applications' resource usage - i.e. the short-term distribution and temporal dynamics of CPU, memory and network usage made by experimental applications currently running in PlanetLab. Understanding these interactions is a prerequisite for the design of new scheduling and regulation mechanisms for the experimental facility and serve as a testbed for future advances in resource scheduling facing the Future Internet. Based on the analysis of publicly available traces, we find that the distribution of resource usage is highly skewed - with a few applications producing most of the resource usage. We use our findings to develop a model that produces good matches in the metrics studied, allowing us to generate a workload with similar statistical characteristics.