Turbine: A Distributed-memory Dataflow Engine for High Performance Many-task Applications

Authors:
Justin M. Wozniak;Timothy G. Armstrong;Ketan Maheshwari;Ewing L. Lusk;Daniel S. Katz;Michael Wilde;Ian T. Foster
Affiliations:
Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL USA. wozniak@mcs.anl.gov;Computer Science Department, University of Chicago, Chicago, IL USA. tga@uchicago.edu;Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL USA. ketan@mcs.anl.gov;Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL USA. lusk@mcs.anl.gov;Computation Institute, University of Chicago & Argonne National Laboratory, Chicago, IL USA. d.katz@ieee.org;Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL USA. wilde@mcs.anl.gov;Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL USA. foster@mcs.anl.gov
Venue:
Fundamenta Informaticae - Scalable Workflow Enactment Engines and Technology
Year:
2013

Citing 32
Cited 0

Linda and Friends

Computer
Practical Programming in Tcl & Tk

Practical Programming in Tcl & Tk
Distributed caching with memcached

Linux Journal
Comet: A Scalable Coordination Space for Decentralized Distributed Environments

HOT-P2P '05 Proceedings of the Second International Workshop on Hot Topics in Peer-to-Peer Systems
What makes workflows work in an opportunistic environment?: Research Articles

Concurrency and Computation: Practice & Experience - Workflow in Grid Systems
Interpreting the data: Parallel analysis with Sawzall

Scientific Programming - Dynamic Grids and Worldwide Computing
SWIG: an easy to use tool for integrating scripting languages with C and C++

TCLTK'96 Proceedings of the 4th conference on USENIX Tcl/Tk Workshop, 1996 - Volume 4
Adaptive and reliable parallel computing on networks of workstations

ATEC '97 Proceedings of the annual conference on USENIX Annual Technical Conference
Dryad: distributed data-parallel programs from sequential building blocks

Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Dynamo: amazon's highly available key-value store

Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
MapReduce: simplified data processing on large clusters

Communications of the ACM - 50th anniversary issue: 1958 - 2008
Pig latin: a not-so-foreign language for data processing

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
An efficient format for nearly constant-time access to arbitrary time intervals in large trace files

Scientific Programming - Large-Scale Programming Tools and Environments
Toward loosely coupled programming on petascale systems

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Scioto: A Framework for Global-View Task Parallelism

ICPP '08 Proceedings of the 2008 37th International Conference on Parallel Processing
Composing and executing parallel data-flow graphs with shell pipes

Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
Parallel Scripting for Applications at the Petascale and Beyond

Computer
Hive: a warehousing solution over a map-reduce framework

Proceedings of the VLDB Endowment
Cassandra: a decentralized structured storage system

ACM SIGOPS Operating Systems Review
Harnessing parallelism in multicore clusters with the All-Pairs, Wavefront, and Makeflow abstractions

Cluster Computing
Twister: a runtime for iterative MapReduce

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
DryadLINQ: a system for general-purpose distributed data-parallel computing using a high-level language

OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
Structured parallel programming with deterministic patterns

HotPar'10 Proceedings of the 2nd USENIX conference on Hot topics in parallelism
Scripting the cloud with skywriting

HotCloud'10 Proceedings of the 2nd USENIX conference on Hot topics in cloud computing
Exascale computing technology challenges

VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
CIEL: a universal execution engine for distributed data-flow computing

Proceedings of the 8th USENIX conference on Networked systems design and implementation
DAGuE: A Generic Distributed DAG Engine for High Performance Computing

IPDPSW '11 Proceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum
Scalable stochastic optimization of complex energy systems

Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Coasters: Uniform Resource Provisioning and Access for Clouds and Grids

UCC '11 Proceedings of the 2011 Fourth IEEE International Conference on Utility and Cloud Computing
Swift: A language for distributed parallel scripting

Parallel Computing
A preconditioning technique for Schur complement systems arising in stochastic optimization

Computational Optimization and Applications
The parallel solution of dense saddle-point linear systems arising in stochastic programming

Optimization Methods & Software - Special issue in honour of Professor Florian A. Potra's 60th birthday

Quantified Score

Hi-index	0.00

Visualization

Abstract

Efficiently utilizing the rapidly increasing concurrency of multi-petaflop computing systems is a significant programming challenge. One approach is to structure applications with an upper layer of many loosely coupled coarse-grained tasks, each comprising a tightly-coupled parallel function or program. “Many-task” programming models such as functional parallel dataflow may be used at the upper layer to generate massive numbers of tasks, each of which generates significant tightly coupled parallelism at the lower level through multithreading, message passing, and/or partitioned global address spaces. At large scales, however, the management of task distribution, data dependencies, and intertask data movement is a significant performance challenge. In this work, we describe Turbine, a new highly scalable and distributed many-task dataflow engine. Turbine executes a generalized many-task intermediate representation with automated self-distribution and is scalable to multi-petaflop infrastructures. We present here the architecture of Turbine and its performance on highly concurrent systems.