Computer
Implementing remote procedure calls
ACM Transactions on Computer Systems (TOCS)
Condor-G: A Computation Management Agent for Multi-Institutional Grids
Cluster Computing
A Resource Management Architecture for Metacomputing Systems
IPPS/SPDP '98 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
The incremental garbage collection of processes
Proceedings of the 1977 symposium on Artificial intelligence and programming languages
The Ethernet Approach to Grid Computing
HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
Distributed computing in practice: the Condor experience: Research Articles
Concurrency and Computation: Practice & Experience - Grid Performance
Journal of Parallel and Distributed Computing
The Globus Striped GridFTP Framework and Server
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Business Process Execution Language for Web Services BPEL and BPEL4WS 2nd Edition
Business Process Execution Language for Web Services BPEL and BPEL4WS 2nd Edition
Pegasus: A framework for mapping complex scientific workflows onto distributed systems
Scientific Programming
Interpreting the data: Parallel analysis with Sawzall
Scientific Programming - Dynamic Grids and Worldwide Computing
Dryad: distributed data-parallel programs from sequential building blocks
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
MapReduce: simplified data processing on large clusters
Communications of the ACM - 50th anniversary issue: 1958 - 2008
Overview of the IBM Blue Gene/P project
IBM Journal of Research and Development
Productive Parallel Programming: The PCN Approach
Scientific Programming
Toward loosely coupled programming on petascale systems
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Composing and executing parallel data-flow graphs with shell pipes
Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
Case studies in storage access by loosely coupled petascale applications
Proceedings of the 4th Annual Workshop on Petascale Data Storage
FlumeJava: easy, efficient data-parallel pipelines
PLDI '10 Proceedings of the 2010 ACM SIGPLAN conference on Programming language design and implementation
OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
Design and Implementation of GXP Make -- A Workflow System Based on Make
ESCIENCE '10 Proceedings of the 2010 IEEE Sixth International Conference on e-Science
Managing appliance launches in infrastructure clouds
Proceedings of the 2011 TeraGrid Conference: Extreme Digital Discovery
AME: an anyscale many-task computing engine
Proceedings of the 6th workshop on Workflows in support of large-scale science
Exploring provenance in high performance scientific computing
Proceedings of the first annual workshop on High performance computing meets databases
Poster: scientific data parallelism using P2P technique
Proceedings of the 2011 companion on High Performance Computing Networking, Storage and Analysis Companion
Online optimization for scheduling preemptable tasks on IaaS cloud systems
Journal of Parallel and Distributed Computing
Job and data clustering for aggregate use of multiple production cyberinfrastructures
Proceedings of the fifth international workshop on Data-Intensive Distributed Computing Date
A Workflow-Aware Storage System: An Opportunity Study
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Stroll: a universal filesystem-based interface for seamless task deployment in grid computing
DAIS'12 Proceedings of the 12th IFIP WG 6.1 international conference on Distributed Applications and Interoperable Systems
Data-intensive architecture for scientific knowledge discovery
Distributed and Parallel Databases
MTCProv: a practical provenance query framework for many-task scientific computing
Distributed and Parallel Databases
Design and analysis of data management in scalable parallel scripting
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Swift/T: scalable data flow programming for many-task applications
Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming
Turbine: a distributed-memory dataflow engine for extreme-scale many-task applications
Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
Provenance traces of the swift parallel scripting system
Proceedings of the Joint EDBT/ICDT 2013 Workshops
Dimensioning the virtual cluster for parallel scientific workflows in clouds
Proceedings of the 4th ACM workshop on Scientific cloud computing
Ab initio elasticity workflow in the VLab science gateway
Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery
Dataflow coordination of data-parallel tasks via MPI 3.0
Proceedings of the 20th European MPI Users' Group Meeting
User-steering of HPC workflows: state-of-the-art and future directions
Proceedings of the 2nd ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
Parallelizing the execution of sequential scripts
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Semantics and provenance for processing element composition in dispel workflows
WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
Predicting intermediate storage performance for workflow applications
PDSW '13 Proceedings of the 8th Parallel Data Storage Workshop
Active data: a data-centric approach to data life-cycle management
PDSW '13 Proceedings of the 8th Parallel Data Storage Workshop
JETS: Language and System Support for Many-Parallel-Task Workflows
Journal of Grid Computing
Runtime Dynamic Structural Changes of Scientific Workflows in Clouds
UCC '13 Proceedings of the 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing
Turbine: A Distributed-memory Dataflow Engine for High Performance Many-task Applications
Fundamenta Informaticae - Scalable Workflow Enactment Engines and Technology
Hi-index | 0.00 |
Scientists, engineers, and statisticians must execute domain-specific application programs many times on large collections of file-based data. This activity requires complex orchestration and data management as data is passed to, from, and among application invocations. Distributed and parallel computing resources can accelerate such processing, but their use further increases programming complexity. The Swift parallel scripting language reduces these complexities by making file system structures accessible via language constructs and by allowing ordinary application programs to be composed into powerful parallel scripts that can efficiently utilize parallel and distributed resources. We present Swift's implicitly parallel and deterministic programming model, which applies external applications to file collections using a functional style that abstracts and simplifies distributed parallel execution.