Middleware support for many-task computing

Authors:
Ioan Raicu;Ian Foster;Mike Wilde;Zhao Zhang;Kamil Iskra;Peter Beckman;Yong Zhao;Alex Szalay;Alok Choudhary;Philip Little;Christopher Moretti;Amitabh Chaudhary;Douglas Thain
Affiliations:
Northwestern University, Evanston, USA;University of Chicago, Chicago, USA and Argonne National Laboratory, Argonne, USA;University of Chicago, Chicago, USA and Argonne National Laboratory, Argonne, USA;University of Chicago, Chicago, USA;University of Chicago, Chicago, USA and Argonne National Laboratory, Argonne, USA;University of Chicago, Chicago, USA and Argonne National Laboratory, Argonne, USA;Microsoft, Redmond, USA;John Hopkins University, Baltimore, USA;Northwestern University, Evanston, USA;University of Notre Dame, Notre Dame, USA;University of Notre Dame, Notre Dame, USA;University of Notre Dame, Notre Dame, USA;University of Notre Dame, Notre Dame, USA
Venue:
Cluster Computing
Year:
2010

Citing 29
Cited 5

Computational grids

The grid
Resource containers: a new facility for resource management in server systems

OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
The Spring System: Integrated Support for Complex Real-TimeSystems

Real-Time Systems
Condor-G: A Computation Management Agent for Multi-Institutional Grids

Cluster Computing
Scripting: Higher-Level Programming for the 21st Century

Computer
GPFS: A Shared-Disk File System for Large Computing Clusters

FAST '02 Proceedings of the Conference on File and Storage Technologies
Sun Grid Engine: Towards Creating a Compute Power Grid

CCGRID '01 Proceedings of the 1st International Symposium on Cluster Computing and the Grid
A survey of Web cache replacement strategies

ACM Computing Surveys (CSUR)
BOINC: A System for Public-Resource Computing and Storage

GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
Distributed computing in practice: the Condor experience: Research Articles

Concurrency and Computation: Practice & Experience - Grid Performance
A Comparison of Two Methods for Building Astronomical Image Mosaics on a Grid

ICPPW '05 Proceedings of the 2005 International Conference on Parallel Processing Workshops
The Anatomy of the Grid: Enabling Scalable Virtual Organizations

International Journal of High Performance Computing Applications
The Globus Striped GridFTP Framework and Server

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
High-Performance Task Distribution for Volunteer Computing

E-SCIENCE '05 Proceedings of the First International Conference on e-Science and Grid Computing
Harnessing grid resources to enable the dynamic analysis of large astronomy datasets

Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Pegasus: A framework for mapping complex scientific workflows onto distributed systems

Scientific Programming
Interpreting the data: Parallel analysis with Sawzall

Scientific Programming - Dynamic Grids and Worldwide Computing
MapReduce: simplified data processing on large clusters

OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
The portable batch scheduler and the maui scheduler on linux clusters

ALS'00 Proceedings of the 4th annual Linux Showcase & Conference - Volume 4
Dryad: distributed data-parallel programs from sequential building blocks

Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Falkon: a Fast and Light-weight tasK executiON framework

Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Overview of the IBM Blue Gene/P project

IBM Journal of Research and Development
Accelerating large-scale data exploration through data diffusion

DADC '08 Proceedings of the 2008 international workshop on Data-aware distributed computing
Toward loosely coupled programming on petascale systems

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Scientific Workflow Systems for 21st Century, New Bottle or New Wine?

SERVICES '08 Proceedings of the 2008 IEEE Congress on Services - Part I
High throughput grid computing with an IBM Blue Gene/L

CLUSTER '07 Proceedings of the 2007 IEEE International Conference on Cluster Computing
The quest for scalable support of data-intensive workloads in distributed systems

Proceedings of the 18th ACM international symposium on High performance distributed computing
Overview of the Blue Gene/L system architecture

IBM Journal of Research and Development
Globus toolkit version 4: software for service-oriented systems

NPC'05 Proceedings of the 2005 IFIP international conference on Network and Parallel Computing

Making a case for distributed file systems at Exascale

Proceedings of the third international workshop on Large-scale system and application performance
Portable and scalable MPI shared file pointers

EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
Design and implementation of "many parallel task" hybrid subsurface model

Proceedings of the 2011 ACM international workshop on Many task computing on grids and supercomputers
Cloud computing for fast prediction of chemical activity

Future Generation Computer Systems
SimMatrix: SIMulator for MAny-Task computing execution fabRIc at eXascale

Proceedings of the High Performance Computing Symposium

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many-task computing aims to bridge the gap between two computing paradigms, high throughput computing and high performance computing. Many-task computing denotes high-performance computations comprising multiple distinct activities, coupled via file system operations. The aggregate number of tasks, quantity of computing, and volumes of data may be extremely large. Traditional techniques found in production systems in the scientific community to support many-task computing do not scale to today's largest systems, due to issues in local resource manager scalability and granularity, efficient utilization of the raw hardware, long wait queue times, and shared/parallel file system contention and scalability. To address these limitations, we adopted a "top-down" approach to building a middleware called Falkon, to support the most demanding many-task computing applications at the largest scales. Falkon (Fast and Light-weight tasK executiON framework) integrates (1) multi-level scheduling to enable dynamic resource provisioning and minimize wait queue times, (2) a streamlined task dispatcher able to achieve orders-of-magnitude higher task dispatch rates than conventional schedulers, and (3) data diffusion which performs data caching and uses a data-aware scheduler to co-locate computational and storage resources. Micro-benchmarks have shown Falkon to achieve over 15K+ tasks/s throughputs, scale to hundreds of thousands of processors and to millions of queued tasks, and execute billions of tasks per day. Data diffusion has also shown to improve applications scalability and performance, with its ability to achieve hundreds of Gb/s I/O rates on modest sized clusters, with Tb/s I/O rates on the horizon. Falkon has shown orders of magnitude improvements in performance and scalability than traditional approaches to resource management across many diverse workloads and applications at scales of billions of tasks on hundreds of thousands of processors across clusters, specialized systems, Grids, and supercomputers. Falkon's performance and scalability have enabled a new class of applications called Many-Task Computing to operate at previously so-believed impossible scales with high efficiency.