Parallel expression template for large vectors

  • Authors:
  • Laurent Plagne;Frank Hülsemann;Denis Barthou;Julien Jaeger

  • Affiliations:
  • EDF R&D, Clamart, France;EDF R&D, Clamart, France;University of Versailles, St Quentin / INRIA, France;University of Versailles, St Quentin, France

  • Venue:
  • Proceedings of the 8th workshop on Parallel/High-Performance Object-Oriented Scientific Computing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a short and simple way of improving the performance of vector operations (e.g. X = aY +bZ +..) applied to large vectors. In a previous paper [1] we described how to take advantage of high performance vector copy operation provided by the ATLAS library [2] in the context of C++ Expression Template (ET) mechanism. Here we present a multi-threaded implementation of this approach. The proposed ET implementation that involves a parallel blocking technique, leads to significant performance increase compared to existing implementations (up to x2.7) on dual socket x86_64 targets.