MCSTL: the multi-core standard template library

Authors:
Johannes Singler;Peter Sanders;Felix Putze
Affiliations:
Universität Karlsruhe;Universität Karlsruhe;Universität Karlsruhe
Venue:
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Year:
2007

Citing 14
Cited 21

DIB—a distributed implementation of backtracking

ACM Transactions on Programming Languages and Systems (TOPLAS)
Merging multiple lists on hierarchical-memory multiprocessors

Journal of Parallel and Distributed Computing - Special issue on shared-memory multiprocessors
An introduction to parallel algorithms

An introduction to parallel algorithms
Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator

ACM Transactions on Modeling and Computer Simulation (TOMACS) - Special issue on uniform random number generation
Random permutations on distributed, external and hierarchical memory

Information Processing Letters
Scheduling multithreaded computations by work stealing

Journal of the ACM (JACM)
Fast priority queues for cached memory

Journal of Experimental Algorithmics (JEA)
C++ Standard Template Library

C++ Standard Template Library
Register Efficient Mergesorting

HiPC '00 Proceedings of the 7th International Conference on High Performance Computing
Asynchronous parallel disk sorting

Proceedings of the fifteenth annual ACM symposium on Parallel algorithms and architectures
A framework for adaptive algorithm selection in STAPL

Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming
MCSTL: the multi-core standard template library

Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming
STAPL: an adaptive, generic parallel C++ library

LCPC'01 Proceedings of the 14th international conference on Languages and compilers for parallel computing
STXXL: standard template library for XXL data sets

ESA'05 Proceedings of the 13th annual European conference on Algorithms

The GNU libstdc++ parallel mode: software engineering considerations

Proceedings of the 1st international workshop on Multicore software engineering
Deque-Free Work-Optimal Parallel STL Algorithms

Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
Parallel geometric algorithms for multi-core computers

Proceedings of the twenty-fifth annual symposium on Computational geometry
Hybrid and custom data structures: evolution of the data structures course

ITiCSE '09 Proceedings of the 14th annual ACM SIGCSE conference on Innovation and technology in computer science education
Combining digital access and parallel partition for quicksort and quickselect

IWMSE '09 Proceedings of the 2009 ICSE Workshop on Multicore Software Engineering
Parallel Skeletons for Variable-Length Lists in SkeTo Skeleton Library

Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Parallel partition revisited

WEA'08 Proceedings of the 7th international conference on Experimental algorithms
Parallelization of bulk operations for STL dictionaries

Euro-Par'07 Proceedings of the 2007 conference on Parallel processing
Parallel geometric algorithms for multi-core computers

Computational Geometry: Theory and Applications
Lessons from implementing the biCGStab method with SkeTo library

Proceedings of the fourth international workshop on High-level parallel programming and applications
A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers

Software—Practice & Experience - Focus on Selected PhD Literature Reviews in the Practical Aspects of Software Technology
Implementing fusion-equipped parallel skeletons by expression templates

IFL'09 Proceedings of the 21st international conference on Implementation and application of functional languages
A generic topology library

Science of Computer Programming
Work-stealing for mixed-mode parallelism by deterministic team-building

Proceedings of the twenty-third annual ACM symposium on Parallelism in algorithms and architectures
Vector class on limited local memory (LLM) multi-core processors

CASES '11 Proceedings of the 14th international conference on Compilers, architectures and synthesis for embedded systems
A comparative study of parallel sort algorithms

Proceedings of the ACM international conference companion on Object oriented programming systems languages and applications companion
Multicore C++ Standard Template Library in a Generative Way

Electronic Notes in Theoretical Computer Science (ENTCS)
Internally deterministic parallel algorithms can be fast

Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
Parallel and I/O efficient set covering algorithms

Proceedings of the twenty-fourth annual ACM symposium on Parallelism in algorithms and architectures
Fast and memory-efficient minimum spanning tree on the GPU

International Journal of Computational Science and Engineering
Comparison based sorting for systems with multiple GPUs

Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units

Quantified Score

Hi-index	0.00

Visualization

Abstract

Future gain in computing performance will not stem from increased clock rates, but from even more cores in a processor. Since automatic parallelization is still limited to easily parallelizable sections of the code, most applications will soon have to support parallelism explicitly. The Multi-Core Standard Template Library (MCSTL) simplifies parallelization by providing efficient parallel implementations of the algorithms in the C++ Standard Template Library. Thus, simple recompilation will provide partial parallelization of applications that make consistent use of the STL. We present performance measurements on several architectures. For example, our sorter achieves a speedup of 21 on an 8-core 32-thread SUN T1.