Extending OpenMP* with vector constructs for modern multicore SIMD architectures

  • Authors:
  • Michael Klemm;Alejandro Duran;Xinmin Tian;Hideki Saito;Diego Caballero;Xavier Martorell

  • Affiliations:
  • Intel Corporation;Barcelona Supercomputing Center, Spain;Intel Corporation;Intel Corporation;Barcelona Supercomputing Center, Spain;Intel Corporation, USA,Universitat Politecnica de Catalunya, Spain

  • Venue:
  • IWOMP'12 Proceedings of the 8th international conference on OpenMP in a Heterogeneous World
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In order to obtain maximum performance, many applications require to extend parallelism from multi-threading to instruction-level (SIMD) parallelism that exists in many current (and future) multi-core architectures. While auto-vectorization technology has been used to exploit this SIMD level, it is not always enough due to OpenMP semantics and compiler technology limitations. In those cases, programmers need to resort to low-level intrinsics or vendor specific directives. We propose a new OpenMP directive: the simd directive. This directive will allow programmers to guide the vectorization process enabling a more productive and portable exploitation of the SIMD level. Our performance results show significant improvements over current auto-vectorizing technology of the Intel® Composer XE 2011.