SCIRun: a scientific programming environment for computational steering
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
The grid: blueprint for a new computing infrastructure
The grid: blueprint for a new computing infrastructure
Querying very large multi-dimensional datasets in ADR
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Automatically tuned linear algebra software
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
The Autopilot performance-directed adaptive control system
Future Generation Computer Systems - I. High Performance Numerical Methods and Applications. II. Performance Data Mining: Automated Diagnosis, Adaption, and Optimization
Convergence Properties of the Nelder--Mead Simplex Method in Low Dimensions
SIAM Journal on Optimization
Falcon: on-line monitoring and steering of large-scale parallel programs
FRONTIERS '95 Proceedings of the Fifth Symposium on the Frontiers of Massively Parallel Computation (Frontiers'95)
Scheduling From the Perspective of the Application
HPDC '96 Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
Forecasting network performance to support dynamic scheduling using the network weather service
HPDC '97 Proceedings of the 6th IEEE International Symposium on High Performance Distributed Computing
Autopilot: Adaptive Control of Distributed Applications
HPDC '98 Proceedings of the 7th IEEE International Symposium on High Performance Distributed Computing
Prediction and Adaptation in Active Harmony
HPDC '98 Proceedings of the 7th IEEE International Symposium on High Performance Distributed Computing
Exposing Application Alternatives
ICDCS '99 Proceedings of the 19th IEEE International Conference on Distributed Computing Systems
A universal algorithm for sequential data compression
IEEE Transactions on Information Theory
Using Information from Prior Runs to Improve Automated Tuning Systems
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Synthetic Simulation of Mesh-Based Parallel Applications Driven by Fine-Grained Profiling
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 15 - Volume 16
Statistical Models for Empirical Search-Based Performance Tuning
International Journal of High Performance Computing Applications
PARSE: A Tool for Parallel Application Run Time Sensitivity Evaluation
ICPADS '06 Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1
MPI performance analysis tools on Blue Gene/L
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Model-based system reconfiguration for dynamic performance management
Journal of Systems and Software
A component infrastructure for performance and power modeling of parallel scientific applications
Proceedings of the 2008 compFrame/HPC-GECO workshop on Component based high performance
Reducing search space of auto-tuners using parallel patterns
IWMSE '09 Proceedings of the 2009 ICSE Workshop on Multicore Software Engineering
Adaptive Application Composition in Quantum Chemistry
QoSA '09 Proceedings of the 5th International Conference on the Quality of Software Architectures: Architectures for Adaptive Software Systems
Tuning parallel applications in parallel
Parallel Computing
Atune-IL: An Instrumentation Language for Auto-tuning Parallel Applications
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
An automated component-based performance experiment environment
Proceedings of the 2009 Workshop on Component-Based High Performance Computing
Scalable dynamic Monitoring, Analysis and Tuning Environment for parallel applications
Journal of Parallel and Distributed Computing
Engineering parallel applications with tunable architectures
Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 1
Black-box performance control for high-volume non-interactive systems
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
Proceedings of the ACM international conference companion on Object oriented programming systems languages and applications companion
PerfExpert: An Easy-to-Use Performance Diagnosis Tool for HPC Applications
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
A language-based tuning mechanism for task and pipeline parallelism
Euro-Par'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II
Dynamic knobs for responsive power-aware computing
Proceedings of the sixteenth international conference on Architectural support for programming languages and operating systems
AARTS: low overhead online adaptive auto-tuning
Proceedings of the 1st International Workshop on Adaptive Self-Tuning Computing Systems for the Exaflop Era
Auto-tuning full applications: A case study
International Journal of High Performance Computing Applications
Managing performance vs. accuracy trade-offs with loop perforation
Proceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on Foundations of software engineering
TAUmon: scalable online performance data analysis in TAU
Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
Asynchronous invocation of adaptations in electronic structure calculations
Proceedings of the 19th High Performance Computing Symposia
TAUg: runtime global performance data access using MPI
EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface
Self-optimization of MPI applications within an autonomic framework
HPCC'06 Proceedings of the Second international conference on High Performance Computing and Communications
Automatic configuration with conflets
EUC'05 Proceedings of the 2005 international conference on Embedded and Ubiquitous Computing
Automatic tuning of master/worker applications
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
MATE: toward scalable automated and dynamic performance tuning environment
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume 2
Language and compiler support for auto-tuning variable-accuracy algorithms
CGO '11 Proceedings of the 9th Annual IEEE/ACM International Symposium on Code Generation and Optimization
Analytical bounds for optimal tile size selection
CC'12 Proceedings of the 21st international conference on Compiler Construction
Siblingrivalry: online autotuning through local competitions
Proceedings of the 2012 international conference on Compilers, architectures and synthesis for embedded systems
Portable section-level tuning of compiler parallelized applications
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
A multi-objective auto-tuning framework for parallel codes
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
TA UoverSupermon: low-overhead online parallel performance monitoring
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Profiling of task-based applications on shared memory machines: scalability and bottlenecks
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Towards making autotuning mainstream
International Journal of High Performance Computing Applications
Towards fully automatic auto-tuning: Leveraging language features of Chapel
International Journal of High Performance Computing Applications
Designing and auto-tuning parallel 3-D FFT for computation-communication overlap
Proceedings of the 19th ACM SIGPLAN symposium on Principles and practice of parallel programming
ACM Transactions on Architecture and Code Optimization (TACO)
Hi-index | 0.00 |
In this paper, we present the Active Harmony automated runtime tuning system. We describe the interface used by programs to make applications tunable. We present the Library Specification Layer which helps program library developers expose multiple variations of the same API using different algorithms. The Library Specification Language helps to select the most appropriate program library to tune the overall performance. We also present the optimization algorithm used to adjust parameters in the application and the libraries. Finally, we present results that show how the system is able to tune several real applications. The automated tuning system is able to tune the application parameters to within a few percent of the best value after evaluating only 11 out of over 1,700 possible configurations.