Evaluating I/O characteristics and methods for storing structured scientific data

  • Authors:
  • Avery Ching;Alok Choudhary;Wei-keng Liao;Lee Ward;Neil Pundit

  • Affiliations:
  • Northwestern University, Department of EECS, Evanston, IL;Northwestern University, Department of EECS, Evanston, IL;Northwestern University, Department of EECS, Evanston, IL;Sandia National Laboratories, Scalable Computer Systems Department, Albuquerque, NM;Sandia National Laboratories, Scalable Computer Systems Department, Albuquerque, NM

  • Venue:
  • IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many large-scale scientific simulations generate large, structured multi-dimensional datasets. Data is stored at various intervals on high performance I/O storage systems for checkpointing, post-processing, and visualization. Data storage is very I/O intensive and can dominate the overall running time of an application, depending on the characteristics of the I/O access pattern. Our NCIO benchmark determines how I/O characteristics greatly affect performance (up to 2 orders of magnitude) and provides scientific application developers with guidelines for improvement. In this paper, we examine the impact of various I/O parameters and methods when using the MPI-IO interface to store structured scientific data in an optimized parallel file system.