PIM Architectures to Support Petaflops Level Computation in the HTMT Machine

Authors:
Peter M. Kogge;Jay B. Brockman;Vincent W. Freeh
Affiliations:
-;-;-
Venue:
IWIA '99 Proceedings of the 1999 International Workshop on Innovative Architecture
Year:
1999

Citing 7
Cited 4

Active messages: a mechanism for integrated communication and computation

ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
Enabling technologies for petaflops computing

Enabling technologies for petaflops computing
A design analysis of a hybrid technology multithreaded architecture for petaflops scale computation3

ICS '99 Proceedings of the 13th international conference on Supercomputing
Microservers: a new memory semantics for massively parallel computing

ICS '99 Proceedings of the 13th international conference on Supercomputing
Mapping irregular applications to DIVA, a PIM-based data-intensive architecture

SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
The Message-Driven Processor: A Multicomputer Processing Node with Efficient Mechanisms

IEEE Micro
High-Performance Electrical Signaling

MPPOI '98 Proceedings of the The Fifth International Conference on Massively Parallel Processing Using Optical Interconnections

A Microserver View of HTMT

IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
The Characterization of Data Intensive Memory Workloads on Distributed PIM Systems

IMS '00 Revised Papers from the Second International Workshop on Intelligent Memory Systems
Characterizing a new class of threads in scientific applications for high end supercomputers

Proceedings of the 18th annual international conference on Supercomputing
Compile-Time thread distinguishment algorithm on VIM-Based architecture

ACSAC'06 Proceedings of the 11th Asia-Pacific conference on Advances in Computer Systems Architecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

The HTMT project is an ambitious attempt to combine a variety of emerging technologies into a petaflops-level computing system available many years before an equivalent machine can be built from current technologies. One of the key problems in such architecture is overcoming latencies between the main memory and the high performance CPUs, which can grow to literally tens of thousands of cycles. In HTMT, the approach taken to overcoming this is a multi-level memory system, with most of the levels to be fabricated using Processing-In-Memory (PIM) technologies in architectures, which actively manage the flow of data without centralized CPU control. This paper overviews the current architecture for such chips within the context of the HTMT system, and how this architecture supports the expected execution model.