Lessons learned from moving earth system grid data sets over a 20 Gbps wide-area network

  • Authors:
  • Raj Kettimuthu;Alex Sim;Dan Gunter;Bill Allcock;Peer-Timo Bremer;John Bresnahan;Andrew Cherry;Lisa Childers;Eli Dart;Ian Foster;Kevin Harms;Jason Hick;Jason Lee;Michael Link;Jeff Long;Keith Miller;Vijaya Natarajan;Valerio Pascucci;Ken Raffenetti;David Ressman;Dean Williams;Loren Wilson;Linda Winkler

  • Affiliations:
  • Argonne National Laboratory, Argonne, IL and The University of Chicago, Chicago, IL;Lawrence Berkeley National Laboratory, Berkeley, CA;Lawrence Berkeley National Laboratory, Berkeley, CA;Argonne National Laboratory, Argonne, IL;Data Direct Networks, Chatsworth, CA;Argonne National Laboratory, Argonne, IL and The University of Chicago, Chicago, IL;Argonne National Laboratory, Argonne, IL;Argonne National Laboratory, Argonne, IL and The University of Chicago, Chicago, IL;Lawrence Berkeley National Laboratory, Berkeley, CA;Argonne National Laboratory, Argonne, IL and The University of Chicago, Chicago, IL;Argonne National Laboratory, Argonne, IL;Lawrence Berkeley National Laboratory, Berkeley, CA;Lawrence Berkeley National Laboratory, Berkeley, CA;Argonne National Laboratory, Argonne, IL and The University of Chicago, Chicago, IL;Lawrence Livermore National Laboratory, Livermore, CA;Data Direct Networks, Chatsworth, CA;Lawrence Berkeley National Laboratory, Berkeley, CA;The University of Utah, Salt Lake City, UT;Argonne National Laboratory, Argonne, IL;Argonne National Laboratory, Argonne, IL;Lawrence Livermore National Laboratory, Livermore, CA;Argonne National Laboratory, Argonne, IL;Argonne National Laboratory, Argonne, IL

  • Venue:
  • Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In preparation for the Intergovernmental Panel on Climate Change (IPCC) Fifth Assessment Report, the climate community will run the Coupled Model Intercomparison Project phase 5 (CMIP-5) experiments, which are designed to answer crucial questions about future regional climate change and the results of carbon feedback for different mitigation scenarios. The CMIP-5 experiments will generate petabytes of data that must be replicated seamlessly, reliably, and quickly to hundreds of research teams around the globe. As an end-to-end test of the technologies that will be used to perform this task, a multi-disciplinary team of researchers moved a small portion (10 TB) of the multimodel Coupled Model Intercomparison Project, Phase 3 data set used in the IPCC Fourth Assessment Report from three sources---the Argonne Leadership Computing Facility (ALCF), Lawrence Livermore National Laboratory (LLNL) and National Energy Research Scientific Computing Center (NERSC)---to the 2009 Supercomputing conference (SC09) show floor in Portland, Oregon, over circuits provided by DOE's ESnet. The team achieved a sustained data rate of 15 Gb/s on a 20 Gb/s network. More important, this effort provided critical feedback on how to deploy, tune, and monitor the middleware that will be used to replicate the upcoming petascale climate datasets. We report on obstacles overcome and the key lessons learned from this successful bandwidth challenge effort.