Parallel I/O Subsystems in Massively Parallel Supercomputers

Authors:
Dror G. Feitelson;Peter F. Corbett;Sandra Johnson Baylor;Yarsun Hsu
Affiliations:
-;-;-;-
Venue:
IEEE Parallel & Distributed Technology: Systems & Technology
Year:
1995

Citing 17
Cited 6

A microprocessor-based hypercube supercomputer

IEEE Micro
Scale and performance in a distributed file system

ACM Transactions on Computer Systems (TOCS)
Computer architecture: a quantitative approach

Computer architecture: a quantitative approach
Distributed file systems: concepts and examples

ACM Computing Surveys (CSUR)
Input/output behavior of supercomputing applications

Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Past, present, parallel: a survey of available parallel computer systems

Past, present, parallel: a survey of available parallel computer systems
The network architecture of the Connection Machine CM-5 (extended abstract)

SPAA '92 Proceedings of the fourth annual ACM symposium on Parallel algorithms and architectures
Architectural requirements of parallel scientific applications with explicit communication

ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
Parallel access to files in the Vesta file system

Proceedings of the 1993 ACM/IEEE conference on Supercomputing
RAID: high-performance, reliable secondary storage

ACM Computing Surveys (CSUR)
Extensible file system (ELFS): an object-oriented approach to high performance file I/O

OOPSLA '94 Proceedings of the ninth annual conference on Object-oriented programming systems, language, and applications
SP2 system architecture

IBM Systems Journal
Parallel file systems for the IBM SP computers

IBM Systems Journal
The Tera computer system

ICS '90 Proceedings of the 4th international conference on Supercomputing
The Concurrent Supercomputing Consortium: Year 1

IEEE Parallel & Distributed Technology: Systems & Technology
File-System Workload on a Scientific Multiprocessor

IEEE Parallel & Distributed Technology: Systems & Technology
MasPar Computer Corporation: The MasPar Scalable Unix I/O System

Proceedings of the 8th International Symposium on Parallel Processing

HFS: a performance-oriented flexible file system based on building-block compositions

ACM Transactions on Computer Systems (TOCS)
The impact of spatial layout of jobs on parallel I/O performance

Proceedings of the sixth workshop on I/O in parallel and distributed systems
An evaluation of Java's I/O capabilities for high-performance computing

Proceedings of the ACM 2000 conference on Java Grande
Compiler-Directed Collective-I/O

IEEE Transactions on Parallel and Distributed Systems
References

Sourcebook of parallel computing
The impact of spatial layout of jobs on I/O hotspots in mesh networks

Journal of Parallel and Distributed Computing - Special issue: Design and performance of networks for super-, cluster-, and grid-computing: Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

Applications on MPPs often require a high aggregate bandwidth of low-latency I/O to secondary storage. This requirement can met by internal parallel I/O subsystems that comprise dedicated I/O nodes, each with processor, memory, and disks.Massively parallel processors (MPPs), encompassing from tens to thousands of processors, are emerging as a major architecture for high-performance computers. Most major computer vendors offer computers with some degree of parallelism, and many smaller vendors specialize in producing MPPs. These machines are targeted for both grand-challenge problems and general-purpose computing.Like any computer, MPP architectural design must balance computation, memory bandwidth and capacity, communication capabilities, and I/O. In the past, most design research focused on the basic compute and communications hardware and software. This led to unbalanced computers that had relatively poor I/O performance. Recently, researchers have focused on designing hardware and software for I/O subsystems in MPPs. Consequently, most current MPPs have an architecture based on an internal parallel I/O subsystem (the "Architectures with parallel I/O" sidebar describes some examples). In these computers, this subsystem encompasses a collection of I/O nodes, each managing and providing I/O access to a set of disks. The I/O nodes connect to other nodes in the system by the same switching network that connects the compute nodes.In this article we'll examine why many MPPs use parallel I/O subsystems, what architecture is best for such a subsystem, and how to implement the subsystem. We'll also discuss how parallel file systems and their user interfaces can exploit the parallel I/O to provide enhanced services to applications.The systems discussed in this article are mostly tightly coupled distributed-memory MIMD (multiple-instruction, multiple-data) MPPs. In some cases, we also discuss shared-memory and SIMD (single-instruction, multiple-data) machines. We'll discuss three node types. Compute nodes are optimized to perform floating-point and numeric calculations, and have no local disk except perhaps for paging, booting, and operating-system software. I/O nodes contain the system's secondary storage, and provide the parallel file-system services. Gateway nodes provide connectivity to external data servers and mass-storage systems. In some cases, individual nodes can serve as more than one type. For example, the same nodes often handle I/O and gateway functions. The "Terminology" sidebar defines some other terms used in this article.