Atomistic nanoelectronic device engineering with sustained performances up to 1.44 PFlop/s

  • Authors:
  • Mathieu Luisier;Timothy B. Boykin;Gerhard Klimeck;Wolfgang Fichtner

  • Affiliations:
  • Purdue University, West Lafayette, IN;University of Alabama in Huntsville, Huntsville, AL;Purdue University, West Lafayette, IN;Integrated Systems Laboratory, ETH Zürich, Zürich, Switzerland

  • Venue:
  • Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a multi-dimensional, atomistic, quantum transport simulation approach to investigate the performances of realistic nanoscale transistors for various geometries and material systems. The central computation consists in solving the Schrödinger equation with open boundary conditions several thousand times. To do that, a Wave Function approach is used since it can be relatively easily parallelized. To further improve the computational efficiency, three additional levels of parallelization are identified, the work load is optimally balanced between the CPUs, computational interleaving is applied where possible, and a mixed precision scheme is introduced. Using two different device types, a high electron mobility and a band-to-band tunneling transistor, sustained performances up to 1.28 PFlop/s in double precision (55% of the peak performance) and 1.44 PFlop/s in mixed precision are reached on 221,400 cores on the CRAY-XT5 Jaguar at Oak Ridge National Lab.