Transparent log-based data storage in MPI-IO applications

  • Authors:
  • Dries Kimpe;Rob Ross;Stefan Vandewalle;Stefaan Poedts

  • Affiliations:
  • Technisch-Wetenschappelijk Rekenen, K.U. Leuven, Leuven, België and Centrum voor Plasma-Astrofysica, K.U.Leuven, Leuven, België;Argonne National Laboratory, Argonne, IL;Technisch-Wetenschappelijk Rekenen, K.U. Leuven, Leuven, België;Centrum voor Plasma-Astrofysica, K.U.Leuven, Leuven, België

  • Venue:
  • PVM/MPI'07 Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

The MPI-IO interface is a critical component in I/O software stacks for high-performance computing, and many successful optimizations have been incorporated into implementations to help provide high performance I/O for a variety of access patterns. However, in spite of these optimizations, there is still a large performance gap between "easy" access patterns and more difficult ones, particularly when applications are unable to describe I/O using collective calls. In this paper we present LogFS, a component that implements logbased storage for applications using the MPI-IO interface. We first discuss how this approach allows us to exploit the temporal freedom present in the MPI-IO consistency semantics, allowing optimization of a variety of access patterns that are not well-served by existing approaches. We then describe how this component is integrated into the ROMIO MPI-IO implementation as a stackable layer, allowing LogFS to be used on any file system supported by ROMIO. Finally we show performance results comparing the LogFS approach to current practice using a variety of benchmarks.