Nebelung: execution environment for transactional OpenMP

  • Authors:
  • Miloš Milovanović;Roger Ferrer;Vladimir Gajinov;Osman S. Unsal;Adrian Cristal;Eduard Ayguadé;Mateo Valero

  • Affiliations:
  • Barcelona Supercomputing Center (BSC-CNS), Barcelona, spain and Department of Computer Architecture, Universitat Politecnica de Catalunya, Barcelona, Spain;Barcelona Supercomputing Center (BSC-CNS), Barcelona, spain and Department of Computer Architecture, Universitat Politecnica de Catalunya, Barcelona, Spain;Barcelona Supercomputing Center (BSC-CNS), Barcelona, spain and Department of Computer Architecture, Universitat Politecnica de Catalunya, Barcelona, Spain;Barcelona Supercomputing Center (BSC-CNS), Barcelona, spain;Barcelona Supercomputing Center (BSC-CNS), Barcelona, spain;Barcelona Supercomputing Center (BSC-CNS), Barcelona, spain and Department of Computer Architecture, Universitat Politecnica de Catalunya, Barcelona, Spain;Barcelona Supercomputing Center (BSC-CNS), Barcelona, spain and Department of Computer Architecture, Universitat Politecnica de Catalunya, Barcelona, Spain

  • Venue:
  • International Journal of Parallel Programming
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Future generations of Chip Multiprocessors (CMP) will provide dozens or even hundreds of cores inside the chip. Writing applications that benefit from the massive computational power offered by these chips is not going to be an easy task for mainstream programmers who are used to sequential algorithms rather than parallel ones. This paper explores the possibility of using Transactional Memory (TM) in OpenMP, the industrial standard for writing parallel programs on shared-memory architectures, for C, C++ and Fortran. One of the major complexities in writing OpenMP applications is the use of critical regions (locks), atomic regions and barriers to synchronize the execution of parallel activities in threads. TM has been proposed as a mechanism that abstracts some of the complexities associated with concurrent access to shared data while enabling scalable performance. The paper presents a first proof-of-concept implementation of OpenMP with TM. Some language extensions to OpenMP are proposed to express transactions. These extensions are implemented in our source-to-source OpenMP Mercurium compiler and our Software Transactional Memory (STM) runtime system Nebelung that supports the code generated by Mercurium. Hardware Transactional Memory (HTM) or Hardware-assisted STM (HaSTM) are seen as possible paths to make the tandem TM-OpenMP more scalable. In the evaluation section we show the preliminary results. The paper finishes with a set of open issues that still need to be addressed, either in OpenMP or in the hardware/software implementations of TM.