Efficiently speeding up sequential computation through the n-way programming model

Authors:
Romain E. Cledat;Tushar Kumar;Santosh Pande
Affiliations:
Georgia Institute of Technology, Atlanta, GA, USA;Georgia Institute of Technology, Atlanta, GA, USA;Georgia Institute of Technology, Atlanta, GA, USA
Venue:
Proceedings of the 2011 ACM international conference on Object oriented programming systems languages and applications
Year:
2011

Citing 20
Cited 1

Randomized algorithms

Randomized algorithms
The cilk system for parallel multithreaded computing

The cilk system for parallel multithreaded computing
Approximation algorithms

Approximation algorithms
Optimal Parallelization of Las Vegas Algorithms

STACS '94 Proceedings of the 11th Annual Symposium on Theoretical Aspects of Computer Science
Language support for lightweight transactions

OOPSLA '03 Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications
Probabilistic accuracy bounds for fault-tolerant computations that discard tasks

Proceedings of the 20th annual international conference on Supercomputing
Versioned boxes as the basis for memory transactions

Science of Computer Programming - Special issue: Synchronization and concurrency in object-oriented languages
Optimistic parallelism requires abstractions

Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
N-variant systems: a secretless framework for security through diversity

USENIX-SS'06 Proceedings of the 15th conference on USENIX Security Symposium - Volume 15
Amdahl's Law in the Multicore Era

Computer
Orchestra: intrusion detection using parallel execution and monitoring of program variants in user-space

Proceedings of the 4th ACM European conference on Computer systems
A Concurrent Portfolio Approach to SMT Solving

CAV '09 Proceedings of the 21st International Conference on Computer Aided Verification
Grace: safe multithreaded programming for C/C++

Proceedings of the 24th ACM SIGPLAN conference on Object oriented programming systems languages and applications
Variant-based competitive parallel execution of sequential programs

Proceedings of the 7th ACM international conference on Computing frontiers
Quality of service profiling

Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 1
The trouble With multi-core

IEEE Spectrum
Opportunistic computing: a new paradigm for scalable realism on many-cores

HotPar'09 Proceedings of the First USENIX conference on Hot topics in parallelism
Ease of use with concurrent collections (CnC)

HotPar'09 Proceedings of the First USENIX conference on Hot topics in parallelism
Error detection using BMC in a parallel environment

CHARME'05 Proceedings of the 13 IFIP WG 10.5 international conference on Correct Hardware Design and Verification Methods
Language and compiler support for auto-tuning variable-accuracy algorithms

CGO '11 Proceedings of the 9th Annual IEEE/ACM International Symposium on Code Generation and Optimization

Multiverse: efficiently supporting distributed high-level speculation

Proceedings of the 2013 ACM SIGPLAN international conference on Object oriented programming systems languages & applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

With core counts on the rise, the sequential components of applications are becoming the major bottleneck in performance scaling as predicted by Amdahl's law. We are therefore faced with the simultaneous problems of occupying an increasing number of cores and speeding up sequential sections. In this work, we reconcile these two seemingly incompatible problems with a novel programming model called N-way. The core idea behind N-way is to benefit from the algorithmic diversity available to express certain key computational steps. By simultaneously launching in parallel multiple ways to solve a given computation, a runtime can just-in-time pick the best (for example the fastest) way and therefore achieve speedup. Previous work has demonstrated the benefits of such an approach but has not addressed its inherent waste. In this work, we focus on providing a mathematically sound learning-based statistical model that can be used by a runtime to determine the optimal balance between resources used and benefits obtainable through N-way. We further describe a dynamic culling mechanism to further reduce resource waste. We present abstractions and a runtime support to cleanly encapsulate the computational-options and monitor their progress. We demonstrate a low-overhead runtime that achieves significant speedup over a range of widely used kernels. Our results demonstrate super-linear speedups in certain cases.