Locality, communication, and interconnect length in multicomputers
SIAM Journal on Computing
Time/space trade-offs for reversible computation
SIAM Journal on Computing
Nanosystems: molecular machinery, manufacturing, and computation
Nanosystems: molecular machinery, manufacturing, and computation
A true single-phase 8-bit adiabatic multiplier
Proceedings of the 38th annual Design Automation Conference
Predictive performance and scalability modeling of a large-scale application
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
System on Chip or System on Package?
IEEE Design & Test
A Decoupled Predictor-Directed Stream Prefetching Architecture
IEEE Transactions on Computers
Gilgamesh: a multithreaded processor-in-memory architecture for petaflops computing
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Overcoming the limitations of conventional vector processors
Proceedings of the 30th annual international symposium on Computer architecture
Reversibility for efficient computing
Reversibility for efficient computing
Reversible computer engineering and architecture
Reversible computer engineering and architecture
Analysis and Modeling of Advanced PIM Architecture Design Tradeoffs
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Concurrency and Computation: Practice & Experience - The High Performance Architectural Challenge: Mass Market versus Proprietary Components?
International Journal of High Performance Computing Applications
A system architecture solution for unreliable nanoelectronic devices
IEEE Transactions on Nanotechnology
Reversible logic for supercomputing
Proceedings of the 2nd conference on Computing frontiers
Task optimization based on CPU pipeline technique in a multicore system
Computers & Mathematics with Applications
Hi-index | 0.00 |
It seems well understood that supercomputer simulation is an enabler for scientific discoveries, weapons, and other activities of value to society. It also seems widely believed that Mooreýs Law will make progressively more powerful supercomputers over time and thus enable more of these contributions. This paper seeks to add detail to these arguments, revealing them to be generally correct but not a smooth and effortless progression. This paper will review some key problems that can be solved with supercomputer simulation, showing that more powerful supercomputers will be useful up to a very high yet finite limit of around 1021 FLOPS (1 Zettaflops . The review will also show the basic nature of these extreme problems. This paper will review work by others showing that the theoretical maximum supercomputer power is very high indeed, but will explain how a straightforward extrapolation of Mooreýs Law will lead to technological maturity in a few decades. The power of a supercomputer at the maturity of Mooreýs Law will be very high by todayýs standards at 1016-1019 FLOPS (100 Petaflops to 10 Exaflops, depending on architecture , but distinctly below the level required for the most ambitious applications. Having established that Mooreýs Law will not be that last word in supercomputing, this paper will explore the nearer term issue of what a supercomputer will look like at maturity of Mooreýs Law. Our approach will quantify the maximum performance as permitted by the laws of physics for extension of current technology and then find a design that approaches this limit closely. We study a "multi-architecture" for supercomputers that combines a microprocessor with other "advanced" concepts and find it can reach the limits as well. This approach should be quite viable in the future because the microprocessor would provide compatibility with existing codes and programming styles while the "advanced" features would provide a boost to the limits of performance.