Server-directed collective I/O in Panda

  • Authors:
  • K. E. Seamons;Y. Chen;P. Jones;J. Jozwiak;M. Winslett

  • Affiliations:
  • Center for Advanced Database Research, Computer Science Department, University of Illinois, Urbana, Illinois;Center for Advanced Database Research, Computer Science Department, University of Illinois, Urbana, Illinois;Center for Advanced Database Research, Computer Science Department, University of Illinois, Urbana, Illinois;Center for Advanced Database Research, Computer Science Department, University of Illinois, Urbana, Illinois;Center for Advanced Database Research, Computer Science Department, University of Illinois, Urbana, Illinois

  • Venue:
  • Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
  • Year:
  • 1995

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present the architecture and implementation results for Panda 2.0, a library for input and output of multidimensional arrays on parallel and sequential platforms. Panda achieves remarkable performance levels on the IBM SP2, showing excellent scalability as data size increases and as the number of nodes increases, and provides throughputs close to the full capacity of the AIX file system on the SP2 we used. We argue that this good performance can be traced to Panda's use of server-directed i/o (a logical-level version of disk-directed i/o [Kotz94b]) to perform array i/o using sequential disk reads and writes, a very high level interface for collective i/o requests, and built-in facilities for arbitrary rearrangements of arrays during i/o. Other advantages of Panda's approach are ease of use, easy application portability, and a reliance on commodity system software.