High Performance Threaded Data Streaming for Large Scale Simulations

  • Authors:
  • Viraj Bhat;Scott Klasky;Scott Atchley;Micah Beck;Doug McCune;Manish Parashar

  • Affiliations:
  • Princeton University, NJ/ Rutgers University, NJ;Princeton University, NJ;University of Tennessee, TN;University of Tennessee, TN;Princeton University, NJ;Rutgers University, NJ

  • Venue:
  • GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We have developed a threaded parallel data streaming approach using Logistical Networking (LN) to transfer multi-terabyte simulation data from computers at NERSC to our local analysis/visualization cluster, as the simulation executes, with negligible overhead. Data transfer experiments show that this concurrent data transfer approach is more favorable compared with writing to local disk and later transferring this data to be post-processed. Our algorithms are network aware, and can stream data at up to 97Mbs on a 100Mbs link from CA to NJ during a live simulation, using less than 5% CPU overhead at NERSC. This method is the first step in setting up a pipeline for simulation workflow and data management.