Will Moore's Law Be Sufficient?

Authors:
Erik P. DeBenedictis
Affiliations:
Sandia National Laboratories
Venue:
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Year:
2004

Citing 16
Cited 2

Locality, communication, and interconnect length in multicomputers

SIAM Journal on Computing
Time/space trade-offs for reversible computation

SIAM Journal on Computing
Nanosystems: molecular machinery, manufacturing, and computation

Nanosystems: molecular machinery, manufacturing, and computation
A true single-phase 8-bit adiabatic multiplier

Proceedings of the 38th annual Design Automation Conference
Predictive performance and scalability modeling of a large-scale application

Proceedings of the 2001 ACM/IEEE conference on Supercomputing
System on Chip or System on Package?

IEEE Design & Test
A Decoupled Predictor-Directed Stream Prefetching Architecture

IEEE Transactions on Computers
Gilgamesh: a multithreaded processor-in-memory architecture for petaflops computing

Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Overcoming the limitations of conventional vector processors

Proceedings of the 30th annual international symposium on Computer architecture
Reversibility for efficient computing

Reversibility for efficient computing
Reversible computer engineering and architecture

Reversible computer engineering and architecture
Analysis and Modeling of Advanced PIM Architecture Design Tradeoffs

Proceedings of the 2004 ACM/IEEE conference on Supercomputing
The Case of the Missing Supercomputer Performance: Achieving Optimal Performance on the 8,192 Processors of ASCI Q

Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Architectural specification for massively parallel computers: an experience and measurement-based approach: Research Articles

Concurrency and Computation: Practice & Experience - The High Performance Architectural Challenge: Mass Market versus Proprietary Components?
Performance and Scalability Analysis of Teraflop-Scale Parallel Architectures Using Multidimensional Wavefront Applications

International Journal of High Performance Computing Applications
A system architecture solution for unreliable nanoelectronic devices

IEEE Transactions on Nanotechnology

Reversible logic for supercomputing

Proceedings of the 2nd conference on Computing frontiers
Task optimization based on CPU pipeline technique in a multicore system

Computers & Mathematics with Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

It seems well understood that supercomputer simulation is an enabler for scientific discoveries, weapons, and other activities of value to society. It also seems widely believed that Mooreýs Law will make progressively more powerful supercomputers over time and thus enable more of these contributions. This paper seeks to add detail to these arguments, revealing them to be generally correct but not a smooth and effortless progression. This paper will review some key problems that can be solved with supercomputer simulation, showing that more powerful supercomputers will be useful up to a very high yet finite limit of around 1021 FLOPS (1 Zettaflops . The review will also show the basic nature of these extreme problems. This paper will review work by others showing that the theoretical maximum supercomputer power is very high indeed, but will explain how a straightforward extrapolation of Mooreýs Law will lead to technological maturity in a few decades. The power of a supercomputer at the maturity of Mooreýs Law will be very high by todayýs standards at 1016-1019 FLOPS (100 Petaflops to 10 Exaflops, depending on architecture , but distinctly below the level required for the most ambitious applications. Having established that Mooreýs Law will not be that last word in supercomputing, this paper will explore the nearer term issue of what a supercomputer will look like at maturity of Mooreýs Law. Our approach will quantify the maximum performance as permitted by the laws of physics for extension of current technology and then find a design that approaches this limit closely. We study a "multi-architecture" for supercomputers that combines a microprocessor with other "advanced" concepts and find it can reach the limits as well. This approach should be quite viable in the future because the microprocessor would provide compatibility with existing codes and programming styles while the "advanced" features would provide a boost to the limits of performance.