Using active NVRAM for I/O staging

  • Authors:
  • Sudarsun Kannan;Ada Gavrilovska;Karsten Schwan;Dejan Milojicic;Vanish Talwar

  • Affiliations:
  • Georgia Institute of Technology, Atlanta, GA, USA;Georgia Institute of Technology, Atlanta, GA, USA;Georgia Institute of Technology, Atlanta, GA, USA;HP Labs, Palo Alto, CA, USA;HP Labs, Palo Alto, CA, USA

  • Venue:
  • Proceedings of the 2nd international workshop on Petascal data analytics: challenges and opportunities
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

With HPC machines moving to the exascale, scaling the I/O performance of applications is a well known problem. Also, a closely related problem is, how to efficiently analyze and extract useful information from I/O data, viz. data post processing. With advent of nonvolatile memory technologies (NVMs) like SSD, PCM and Memristor, research has been focusing on how to improve the file systems performance and optimizations to overcome disk latencies. In the other end, there has been extensive focus on 'DataStaging' or 'in situ' I/O processing where I/O data are moved from computational cores to memory buffers of dedicated data processing or staging nodes using high performance I/O channels. The I/O data gets processed in these nodes before writing them to persistent storage like disks. However, issues with such approaches include (1) the limitation that they cannot easily analyze temporal data relationships or characteristics embedded in multiple simulation output steps, due to the limited aggregate memory capacity of staging nodes, and (2) the need to 'right size' such staging memory, sometimes even for single output/checkpoint steps when data volumes are large. Failing to properly allocate staging memory buffers (2) can cause applications to block and severely degrade the performance improvements sought by the extensive parallelization efforts undertaken by application developers. The limitation posed by (1) can degrade the utility of the Staging approach seen by end users. This paper explores an alternative solution for 'right memory sizing' issue for staging I/O. In this solution, memory scaling avoids the cost and power constraints imposed on machine designers by the use of DRAM (memory), by instead, using active NVRAM(nonvolatile memory) to enhance the memory capacities of compute and staging nodes. Active NVRAMs are node-local NVRAMs that are embedded with a low power system-on-chip compute element. We propose a mechanism, in which each physical node has an additional active NVRAM component to stage I/O and apply simple data analytics operations over the I/O data. While such node local data storage provides an obvious I/O acceleration, our experimental results show the effectiveness of our approach in addressing 'right memory sizing issue' by efficient I/O data processing. We also discuss the overheads in using Active NVRAM based approach for I/O staging.