Gilgamesh: a multithreaded processor-in-memory architecture for petaflops computing

Authors:
Thomas L. Sterling;Hans P. Zima
Affiliations:
California Institute of Technology, Pasadena, California;California Institute of Technology, Pasadena, California and University of Vienna, Austria
Venue:
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Year:
2002

Citing 20
Cited 12

MULTILISP: a language for concurrent symbolic computation

ACM Transactions on Programming Languages and Systems (TOPLAS)
Actors: a model of concurrent computation in distributed systems

Actors: a model of concurrent computation in distributed systems
Multilanguage Parallel Programming of Heterogeneous Machines

IEEE Transactions on Computers - Special issue on architectural support for programming languages and operating systems
Orca: A Language for Parallel Programming of Distributed Systems

IEEE Transactions on Software Engineering
Active messages: a mechanism for integrated communication and computation

ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
Global communication analysis and optimization

PLDI '96 Proceedings of the ACM SIGPLAN 1996 conference on Programming language design and implementation
A Unified Framework for Optimizing Communication in Data-Parallel Programs

IEEE Transactions on Parallel and Distributed Systems
Vienna-Fortran/HPF Extensions for Sparse and Irregular Problems and Their Compilation

IEEE Transactions on Parallel and Distributed Systems
High performance Fortran: history, status and future

Parallel Computing - Special issues on languages and compilers for parallel computers
A design analysis of a hybrid technology multithreaded architecture for petaflops scale computation3

ICS '99 Proceedings of the 13th international conference on Supercomputing
Microservers: a new memory semantics for massively parallel computing

ICS '99 Proceedings of the 13th international conference on Supercomputing
Mapping irregular applications to DIVA, a PIM-based data-intensive architecture

SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Compiling high performance Fortran for distributed-memory architectures

Parallel Computing - Special Anniversary issue
The architecture of the DIVA processing-in-memory chip

ICS '02 Proceedings of the 16th international conference on Supercomputing
Language Support for Multidisciplinary Applications

IEEE Computational Science & Engineering
Processing in Memory: The Terasys Massively Parallel PIM Array

Computer
The Message-Driven Processor: A Multicomputer Processing Node with Efficient Mechanisms

IEEE Micro
A Case for Intelligent RAM

IEEE Micro
FlexRAM: Toward an Advanced Intelligent Memory System

ICCD '99 Proceedings of the 1999 IEEE International Conference on Computer Design
Opus: A Coordination Language for Multidisciplinary Applications

Scientific Programming

A simple parallel system

ACM SIGPLAN Notices
Will Moore's Law Be Sufficient?

Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Enhancing NIC Performance for MPI using Processing-in-Memory

IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 9 - Volume 10
Memory In Processor-Supercomputer On a Chip: Processor Design and Execution Semantics for Massive Single-Chip Performance

IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 14 - Volume 15
A low cost, multithreaded processing-in-memory system

WMPI '04 Proceedings of the 3rd workshop on Memory performance issues: in conjunction with the 31st international symposium on computer architecture
Reversible logic for supercomputing

Proceedings of the 2nd conference on Computing frontiers
Cache oblivious algorithms for nonserial polyadic programming

The Journal of Supercomputing
ZEN: a directive based experiment specification language for performance and parameter studies of parallel scientific applications

International Journal of High Performance Computing and Networking
Future generation supercomputers I: a paradigm for node architecture

ACM SIGARCH Computer Architecture News - Special issue: ALPS '07---advanced low power systems
Enhanced loop coalescing: a compiler technique for transforming non-uniform iteration spaces

ISHPC'05/ALPS'06 Proceedings of the 6th international symposium on high-performance computing and 1st international conference on Advanced low power systems
Grid computing: experiment management, tool integration, and scientific workflows

Grid computing: experiment management, tool integration, and scientific workflows
Synchronization mechanisms on modern multi-core architectures

ACSAC'07 Proceedings of the 12th Asia-Pacific conference on Advances in Computer Systems Architecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

Processor-in-Memory (PIM) architectures avoid the von Neumann bottleneck in conventional machines by integrating high-density DRAM and CMOS logic on the same chip. Parallel systems based on this new technology are expected to provide higher scalability, adaptability, robustness, fault tolerance and lower power consumption than current MPPs or commodity clusters. In this paper we describe the design of Gilgamesh, a PIM-based massively parallel architecture, and elements of its execution model. Gilgamesh extends existing PIM capabilities by incorporating advanced mechanisms for virtualizing tasks and data and providing adaptive resource management for load balancing and latency tolerance. The Gilgamesh execution model is based on macroservers, a middleware layer which supports object-based runtime management of data and threads allowing explicit and dynamic control of locality and load balancing. The paper concludes with a discussion of related research activities and an outlook to future work.