Lithe: enabling efficient composition of parallel libraries

Authors:
Heidi Pan;Benjamin Hindman;Krste Asanović
Affiliations:
Massachusetts Institute of Technology;University of California, Berkeley;University of California, Berkeley
Venue:
HotPar'09 Proceedings of the First USENIX conference on Hot topics in parallelism
Year:
2009

Citing 11
Cited 6

Scheduler activations: effective kernel support for the user-level management of parallelism

SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
Cilk: an efficient multithreaded runtime system

PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Exploiting choice: instruction fetch and issue on an implementable simultaneous multithreading processor

ISCA '96 Proceedings of the 23rd annual international symposium on Computer architecture
CPU inheritance scheduling

OSDI '96 Proceedings of the second USENIX symposium on Operating systems design and implementation
Parallel programming in OpenMP

Parallel programming in OpenMP
A hierarchical load-balancing framework for dynamic multithreaded computations

SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Threads for Interoperable Parallel Programming

LCPC '96 Proceedings of the 9th International Workshop on Languages and Compilers for Parallel Computing
Multi-threading and one-sided communication in parallel LU factorization

Proceedings of the 2007 ACM/IEEE conference on Supercomputing
A scheduling framework for general-purpose parallel languages

Proceedings of the 13th ACM SIGPLAN international conference on Functional programming
Intel threading building blocks

Intel threading building blocks
Tessellation: space-time partitioning in a manycore client OS

HotPar'09 Proceedings of the First USENIX conference on Hot topics in parallelism

Tessellation: space-time partitioning in a manycore client OS

HotPar'09 Proceedings of the First USENIX conference on Hot topics in parallelism
Improving per-node efficiency in the datacenter with new OS abstractions

Proceedings of the 2nd ACM Symposium on Cloud Computing
Writing parallel libraries with MPI - common practice, issues, and extensions

EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
Which problems does a multi-language virtual machine need to solve in the multicore/manycore era?

Proceedings of the compilation of the co-located workshops on DSM'11, TMC'11, AGERE!'11, AOOPES'11, NEAT'11, & VMIL'11
Omega: flexible, scalable schedulers for large compute clusters

Proceedings of the 8th ACM European Conference on Computer Systems
ZSim: fast and accurate microarchitectural simulation of thousand-core systems

Proceedings of the 40th Annual International Symposium on Computer Architecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

For the software industry to take advantage of multicore processors, we must allow programmers to arbitrarily compose parallel libraries without sacrificing performance. We argue that high-level task or thread abstractions and a common global scheduler cannot provide effective library composition. Instead, the operating system should expose unvirtualized processing resources that can be shared cooperatively between parallel libraries within an application. In this paper, we describe a system that standardizes and facilitates the exchange of these unvirtualized processing resources between libraries.