Comparing Hardware Accelerators in Scientific Applications: A Case Study

  • Authors:
  • Rick Weber;Akila Gothandaraman;Robert J. Hinde;Gregory D. Peterson

  • Affiliations:
  • University of Tennessee, Knoxville;University of Pittsburgh, Pittsburgh;University of Tennessee, Knoxville;University of Tennessee, Knoxville

  • Venue:
  • IEEE Transactions on Parallel and Distributed Systems
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Multicore processors and a variety of accelerators have allowed scientific applications to scale to larger problem sizes. We present a performance, design methodology, platform, and architectural comparison of several application accelerators executing a Quantum Monte Carlo application. We compare the application's performance and programmability on a variety of platforms including CUDA with Nvidia GPUs, Brook+ with ATI graphics accelerators, OpenCL running on both multicore and graphics processors, C++ running on multicore processors, and a VHDL implementation running on a Xilinx FPGA. We show that OpenCL provides application portability between multicore processors and GPUs, but may incur a performance cost. Furthermore, we illustrate that graphics accelerators can make simulations involving large numbers of particles feasible.