Analysis of benchmark characteristics and benchmark performance prediction
ACM Transactions on Computer Systems (TOCS)
Modeling cost/performance of a parallel computer simulator
ACM Transactions on Modeling and Computer Simulation (TOMACS)
Semi-empirical multiprocessor performance predictions
Journal of Parallel and Distributed Computing
Converting thread-level parallelism to instruction-level parallelism via simultaneous multithreading
ACM Transactions on Computer Systems (TOCS)
Proceedings of the 1st international workshop on Software and performance
FLASH vs. (Simulated) FLASH: closing the simulation loop
ASPLOS IX Proceedings of the ninth international conference on Architectural support for programming languages and operating systems
Parallel performance prediction using lost cycles analysis
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Measuring Cache and TLB Performance and Their Effect on Benchmark Runtimes
IEEE Transactions on Computers
Performance Characterization of Optimizing Compilers
IEEE Transactions on Software Engineering
Modeling the Communication Performance of the IBM SP2
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Accurate Performance Prediction for Assively Parallel Systems and Its Applications
Euro-Par '96 Proceedings of the Second International Euro-Par Conference on Parallel Processing-Volume II
Performance Analysis of Wavefront Algorithms on Very-Large Scale Distributed Systems
Workshop on Wide Area Networks and High Performance Computing
Scalability Analysis of Multidimensional Wavefront Algorithms on Large-Scale SMP Clusters
FRONTIERS '99 Proceedings of the The 7th Symposium on the Frontiers of Massively Parallel Computation
Integrated Compilation and Scalability Analysis for Parallel Systems
PACT '98 Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques
Modeling application performance by convolving machine signatures with application profiles
WWC '01 Proceedings of the Workload Characterization, 2001. WWC-4. 2001 IEEE International Workshop
ICPP '94 Proceedings of the 1994 International Conference on Parallel Processing - Volume 03
Detection and Parallel Execution of Independent Instructions
IEEE Transactions on Computers
The IBM system/360 model 91: storage system
IBM Journal of Research and Development
A performance prediction framework for scientific applications
ICCS'03 Proceedings of the 2003 international conference on Computational science: PartIII
Identification of performance characteristics from multi-view trace analysis
ICCS'03 Proceedings of the 2003 international conference on Computational science: PartIII
A Performance Model of the Parallel Ocean Program
International Journal of High Performance Computing Applications
Job-resource matchmaking on Grid through two-level benchmarking
Future Generation Computer Systems
Self-similarity: Behind workload reshaping and prediction
Future Generation Computer Systems
Decentralized proactive resource allocation for maximizing throughput of P2P Grid
Journal of Parallel and Distributed Computing
Compiler-Directed performance model construction for parallel programs
ARCS'10 Proceedings of the 23rd international conference on Architecture of Computing Systems
Host load prediction in a Google compute cloud with a Bayesian model
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
Exascale workload characterization and architecture implications
Proceedings of the High Performance Computing Symposium
Using automated performance modeling to find scalability bugs in complex codes
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Google hostload prediction based on Bayesian model with optimized feature combination
Journal of Parallel and Distributed Computing
Hi-index | 0.00 |
This work presents the results of ongoing investigations in the development of a performance modeling framework, developed by the Performance Modeling and Characterization (PMaC) Lab at the San Diego Supercomputer Center. The framework is faster than traditional cycle-accurate simulation, more sophisticated than performance estimation based on system peak-performance metrics, and is shown to be effective on benchmarks and scientific applications. This paper focuses on one such functionality by investigating sensitivity studies to further understand observed and anticipated effect of both the architecture and the application in predicted runtime.