GoldRush: resource efficient in situ scientific data analytics using fine-grained interference aware execution

Authors:
Fang Zheng;Hongfeng Yu;Can Hantas;Matthew Wolf;Greg Eisenhauer;Karsten Schwan;Hasan Abbasi;Scott Klasky
Affiliations:
Georgia Institute of Technology;University of Nebraska Lincoln;Georgia Institute of Technology;Georgia Institute of Technology and Oak Ridge National Laboratory;Georgia Institute of Technology;Georgia Institute of Technology;Oak Ridge National Laboratory;Oak Ridge National Laboratory
Venue:
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Year:
2013

Citing 28
Cited 0

Fast parallel algorithms for short-range molecular dynamics

Journal of Computational Physics
Linger Longer: fine-grain cycle stealing for networks of workstations

SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Grid -Based Parallel Data Streaming implemented for the Gyrokinetic Toroidal Code

Proceedings of the 2003 ACM/IEEE conference on Supercomputing
An Integrated Exploration Approach to Visualizing Multivariate Particle Data

Computing in Science and Engineering
Massively parallel volume rendering using 2-3 swap image compositing

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
High performance multivariate visual data exploration for extremely large data

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Adaptable, metadata rich IO methods for portable high performance IO

IPDPS '09 Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing
Soft-OLP: Improving Hardware Cache Performance through Software-Controlled Object-Level Partitioning

PACT '09 Proceedings of the 2009 18th International Conference on Parallel Architectures and Compilation Techniques
Scalable computation of streamlines on very large datasets

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Addressing shared resource contention in multicore processors via scheduling

Proceedings of the fifteenth edition of ASPLOS on Architectural support for programming languages and operating systems
Contention aware execution: online contention detection and response

Proceedings of the 8th annual IEEE/ACM international symposium on Code generation and optimization
In Situ Visualization for Large-Scale Combustion Simulations

IEEE Computer Graphics and Applications
DataSpaces: an interaction and coordination framework for coupled simulation workflows

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Characterizing the Influence of System Noise on Large-Scale Applications by Simulation

Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Functional Partitioning to Optimize End-to-End Performance on Many-core Architectures

Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Reducing Cache Pollution Through Detection and Elimination of Non-Temporal Memory Accesses

Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
SRC: Damaris - using dedicated i/o cores for scalable post-petascale HPC simulations

Proceedings of the international conference on Supercomputing
Just in time: adding value to the IO pipelines of high performance applications with JITStaging

Proceedings of the 20th international symposium on High performance distributed computing
Anywhere, any-time binary instrumentation

Proceedings of the 10th ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools
Compressing the incompressible with ISABELA: in-situ reduction of spatio-temporal data

Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
A Study of Parallel Particle Tracing for Steady-State and Time-Varying Flow Fields

IPDPS '11 Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium
Topology-aware data movement and staging for I/O acceleration on Blue Gene/P supercomputing systems

Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Region scheduling: efficiently using the cache architectures via page-level affinity

ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems
Compiling for niceness: mitigating contention for QoS in warehouse scale computers

Proceedings of the Tenth International Symposium on Code Generation and Optimization
Enabling In-situ Execution of Coupled Scientific Workflow on Multi-core Platform

IPDPS '12 Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium
Combining in-situ and in-transit processing to enable extreme-scale scientific analysis

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
ReQoS: reactive static/dynamic compilation for QoS in warehouse scale computers

Proceedings of the eighteenth international conference on Architectural support for programming languages and operating systems
FlexIO: I/O Middleware for Location-Flexible Scientific Data Analytics

IPDPS '13 Proceedings of the 2013 IEEE 27th International Symposium on Parallel and Distributed Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Severe I/O bottlenecks on High End Computing platforms call for running data analytics in situ. Demonstrating that there exist considerable resources in compute nodes un-used by typical high end scientific simulations, we leverage this fact by creating an agile runtime, termed GoldRush, that can harvest those otherwise wasted, idle resources to efficiently run in situ data analytics. GoldRush uses fine-grained scheduling to "steal" idle resources, in ways that minimize interference between the simulation and in situ analytics. This involves recognizing the potential causes of on-node resource contention and then using scheduling methods that prevent them. Experiments with representative science applications at large scales show that resources harvested on compute nodes can be leveraged to perform useful analytics, significantly improving resource efficiency, reducing data movement costs incurred by alternate solutions, and posing negligible impact on scientific simulations.