Public international benchmarks for parallel computers: PARKBENCH committee: Report-1
Scientific Programming
Performance evaluation of the IBM SP and the Compaq AlphaServer SC
Proceedings of the 14th international conference on Supercomputing
Modelling the Runtime of Scientific Programs on Parallel Computers
ICPP '00 Proceedings of the 2000 International Workshop on Parallel Processing
The Quadrics Network (QsNet): High-Performance Clustering Technology
HOTI '01 Proceedings of the The Ninth Symposium on High Performance Interconnects
Pace--A Toolset for the Performance Prediction of Parallel and Distributed Systems
International Journal of High Performance Computing Applications
International Journal of High Performance Computing Applications
Performance Evaluation of I/O Traffic and Placement of I/O Nodes on a High Performance Network
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Performance modeling of deterministic transport computations
Performance analysis and grid computing
Verifying large-scale system performance during installation using modelling
High performance scientific and engineering computing
Cross-architecture performance predictions for scientific applications using parameterized models
Proceedings of the joint international conference on Measurement and modeling of computer systems
A Performance and Scalability Analysis of the BlueGene/L Architecture
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Will Moore's Law Be Sufficient?
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
BCS-MPI: A New Approach in the System Software Design for Large-Scale Parallel Computers
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Scalable NIC-based Reduction on Large-scale Clusters
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Monitoring and Debugging Parallel Software with BCS-MPI on Large-Scale Clusters
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 18 - Volume 19
A Framework for Measuring Supercomputer Productivity
International Journal of High Performance Computing Applications
An empirical performance analysis of commodity memories in commodity servers
MSP '04 Proceedings of the 2004 workshop on Memory system performance
A Performance Evaluation of an Alpha EV7 Processing Node
International Journal of High Performance Computing Applications
Optimizing Sparse Matrix-Vector Product Computations Using Unroll and Jam
International Journal of High Performance Computing Applications
A General Performance Model of Structured and Unstructured Mesh Particle Transport Computations
The Journal of Supercomputing
System noise, OS clock ticks, and fine-grained parallel applications
Proceedings of the 19th annual international conference on Supercomputing
Adaptive Parallel Job Scheduling with Flexible Coscheduling
IEEE Transactions on Parallel and Distributed Systems
A Performance Model of the Parallel Ocean Program
International Journal of High Performance Computing Applications
Scalable Parallel Octree Meshing for TeraScale Applications
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Cross-Platform Performance Prediction of Parallel Applications Using Partial Execution
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
ACM SIGOPS Operating Systems Review
A performance model of non-deterministic particle transport on large-scale systems
Future Generation Computer Systems
Performance feature identification by comparative trace analysis
Future Generation Computer Systems
Performance prediction and its use in parallel and distributed computing systems
Future Generation Computer Systems - Systems performance analysis and evaluation
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
MPI performance analysis tools on Blue Gene/L
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Methods of inference and learning for performance modeling of parallel applications
Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming
NIC-based reduction algorithms for large-scale clusters
International Journal of High Performance Computing and Networking
Parallelization of the Fvcom Coastal Ocean Model
International Journal of High Performance Computing Applications
Accurate memory signatures and synthetic address traces for HPC applications
Proceedings of the 22nd annual international conference on Supercomputing
A regression-based approach to scalability prediction
Proceedings of the 22nd annual international conference on Supercomputing
Performance modeling of parallel applications for grid scheduling
Journal of Parallel and Distributed Computing
A simulator for adaptive parallel applications
Journal of Computer and System Sciences
Characterizing application sensitivity to OS interference using kernel-level noise injection
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Performance prediction of large-scale parallell system and application using macro-level simulation
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Computational forces in the SAGE benchmark
Journal of Parallel and Distributed Computing
Instruction-level simulation of a cluster at scale
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
FACT: fast communication trace collection for parallel applications through program slicing
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Optimizing task layout on the Blue Gene/L supercomputer
IBM Journal of Research and Development
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
A performance model of non-deterministic particle transport on large-scale systems
Future Generation Computer Systems
Performance feature identification by comparative trace analysis
Future Generation Computer Systems
MPInside: a performance analysis and diagnostic tool for MPI applications
Proceedings of the first joint WOSP/SIPEW international conference on Performance engineering
Performance modeling for dynamic algorithm selection
ICCS'03 Proceedings of the 2003 international conference on Computational science
A performance model of non-deterministic particle transport on large-scale systems
ICCS'03 Proceedings of the 2003 international conference on Computational science: PartIII
A compiler approach to performance prediction using empirical-based modeling
ICCS'03 Proceedings of the 2003 international conference on Computational science: PartIII
A performance prediction framework for scientific applications
ICCS'03 Proceedings of the 2003 international conference on Computational science: PartIII
Identification of performance characteristics from multi-view trace analysis
ICCS'03 Proceedings of the 2003 international conference on Computational science: PartIII
Run-time optimizations for replicated dataflows on heterogeneous environments
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Characterizing the impact of using spare-cores on application performance
EuroPar'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part I
A look at application performance sensitivity to the bandwidth and latency of infiniband networks
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
A simulator for parallel applications with dynamically varying compute node allocation
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
A study of MPI performance analysis tools on blue gene/L
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Dynamic performance prediction of an adaptive mesh application
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
ScalaExtrap: trace-based communication extrapolation for spmd programs
Proceedings of the 16th ACM symposium on Principles and practice of parallel programming
Self-similarity of parallel machines
Parallel Computing
Minimal-overhead virtualization of a large scale supercomputer
Proceedings of the 7th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
GREENCOM-CPSCOM '10 Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing
Benchmarking and modelling of POWER7, Westmere, BG/P, and GPUs: an industry case study
ACM SIGMETRICS Performance Evaluation Review - Special issue on the 1st international workshop on performance modeling, benchmarking and simulation of high performance computing systems (PMBS 10)
Reliable performance prediction for multigrid software on distributed memory systems
Advances in Engineering Software
Modeling the performance of an algebraic multigrid cycle on HPC platforms
Proceedings of the international conference on Supercomputing
An idiom-finding tool for increasing productivity of accelerators
Proceedings of the international conference on Supercomputing
Performance engineering: a must for petascale and beyond
Proceedings of the third international workshop on Large-scale system and application performance
Predictive analysis of a hydrodynamics application on large-scale CMP clusters
Computer Science - Research and Development
Bridging performance analysis tools and analytic performance modeling for HPC
Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
Performance modeling for systematic performance tuning
State of the Practice Reports
Evaluating the viability of process replication reliability for exascale systems
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Hierarchical model validation of symbolic performance models of scientific kernels
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
An approach to performance prediction for parallel applications
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
ScalaExtrap: Trace-based communication extrapolation for SPMD programs
ACM Transactions on Programming Languages and Systems (TOPLAS)
ScalaTrace: tracing, analysis and modeling of HPC codes at scale
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume 2
Performance Modeling and Comparative Analysis of the MILC Lattice QCD Application su3_rmd
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Optimizing dataflow applications on heterogeneous environments
Cluster Computing
Fat-tree routing and node ordering providing contention free traffic for MPI global collectives
Journal of Parallel and Distributed Computing
Alleviating scalability issues of checkpointing protocols
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Dataflow-driven GPU performance projection for multi-kernel transformations
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Aspen: a domain specific language for performance modeling
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Extending the BT NAS parallel benchmark to exascale computing
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Profiling of task-based applications on shared memory machines: scalability and bottlenecks
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Performance analysis of an optical circuit switched network for peta-scale systems
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
Parallel performance prediction for multigrid codes on distributed memory architectures
HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
Simsys: a performance simulation framework
Proceedings of the 2013 Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools
The impact of system design parameters on application noise sensitivity
Cluster Computing
ACIC: automatic cloud I/O configurator for HPC applications
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Using automated performance modeling to find scalability bugs in complex codes
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
A communications simulation methodology for AMR codes using task dependency analysis
IA^3 '13 Proceedings of the 3rd Workshop on Irregular Applications: Architectures and Algorithms
Accelerating incremental checkpointing for extreme-scale computing
Future Generation Computer Systems
Hi-index | 0.00 |
In this work we present a predictive analytical model that encompasses the performance and scaling characteristics of an important ASCI application. SAGE (SAIC's Adaptive Grid Eulerian hydrocode) is a multidimensional hydrodynamics code with adaptive mesh refinement. The model is validated against measurements on several systems including ASCI Blue Mountain, ASCI White, and a Compaq Alphaserver ES45 system showing high accuracy. It is parametric --- basic machine performance numbers (latency, MFLOPS rate, bandwidth) and application characteristics (problem size, decomposition method, etc.) serve as input. The model is applied to add insight into the performance of current systems, to reveal bottlenecks, and to illustrate where tuning efforts can be effective. We also use the model to predict performance on future systems.