The integration of application and system based metrics in a parallel program performance tool
PPOPP '91 Proceedings of the third ACM SIGPLAN symposium on Principles and practice of parallel programming
Dynamic control of performance monitoring on large scale parallel systems
ICS '93 Proceedings of the 7th international conference on Supercomputing
Rewriting executable files to measure program behavior
Software—Practice & Experience
Software—Practice & Experience
The Application Visualization System: A Computational Environment for Scientific Visualization
IEEE Computer Graphics and Applications
Visualizing the Performance of Parallel Programs
IEEE Software
IPS-2: The Second Generation of a Parallel Program Measurement System
IEEE Transactions on Parallel and Distributed Systems
Mapping performance data for high-level and data views of parallel program performance
ICS '96 Proceedings of the 10th international conference on Supercomputing
Waiting time analysis and performance visualization in Carnival
SPDT '96 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
An online computation of critical path profiling
SPDT '96 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
Two performance tool design issues and CHITRA's solutions
SPDT '96 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
Using the SimOS machine simulator to study complex computer systems
ACM Transactions on Modeling and Computer Simulation (TOMACS)
Performance debugging shared memory parallel programs using run-time dependence analysis
SIGMETRICS '97 Proceedings of the 1997 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Shared-memory performance profiling
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
Performance measurements for multithreaded programs
SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
OCM—a monitoring system for interoperable tools
SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
An object-based infrastructure for program monitoring and steering
SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
An empirically derived framework for classifying parallel program performance tuning problems
SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
Searching for the sorting record: experiences in tuning NOW-Sort
SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
Portable profiling and tracing for parallel, scientific applications using C++
SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
Modeling and Evaluating Design Alternatives for an On-Line Instrumentation System: A Case Study
IEEE Transactions on Software Engineering
Critical Path Profiling of Message Passing and Shared-Memory Programs
IEEE Transactions on Parallel and Distributed Systems
Evaluation of a prototype visualization for distributed simulations
Proceedings of the 30th conference on Winter simulation
An Application-Driven Study of Parallel System Overheads and Network Bandwidth Requirements
IEEE Transactions on Parallel and Distributed Systems
Dynamic instrumentation of threaded applications
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Performance measurement of dynamically compiled Java executions
JAVA '99 Proceedings of the ACM 1999 conference on Java Grande
Supporting Scalable Performance Monitoring and Analysis of Parallel Programs
The Journal of Supercomputing
Web-based performance visualization of distributed discrete event simulation
Proceedings of the 31st conference on Winter simulation: Simulation---a bridge to the future - Volume 2
Running EveryWare on the computational grid
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Improving online performance diagnosis by the use of historical performance data
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Managing performance analysis with dynamic statistical projection pursuit
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Proceedings of the 14th international conference on Supercomputing
Safety checking of machine code
PLDI '00 Proceedings of the ACM SIGPLAN 2000 conference on Programming language design and implementation
Execution-driven performance analysis for distributed and parallel systems
Proceedings of the 2nd international workshop on Software and performance
Profiling a parallel language based on fine-grained communication
Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Modeling, evaluation, and testing of paradyn instrumentation system
Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
CTK: Configurable Object Abstractions for Multiprocessors
IEEE Transactions on Software Engineering
A Tool to Help Tune where Computation Is Performed
IEEE Transactions on Software Engineering
Writing Programs that Run EveryWare on the Computational Grid
IEEE Transactions on Parallel and Distributed Systems
Visualization of do-loop performance using AVS
Progress in computer research
A framework for performance-based program partitioning
Progress in computer research
Experiment management support for performance tuning
SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
Visualization of do-loop performance using AVS
Progress in computer research
A framework for performance-based program partitioning
Progress in computer research
On using SCALEA for performance analysis of distributed and parallel programs
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Modeling and detecting performance problems for distributed and parallel programs with JavaPSL
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Hardware-Assisted Characterization of NAS Benchmarks
Cluster Computing
PHOENIX: A Self Adaptable Monitoring Platform for Cluster Management
Cluster Computing
Performance Tuning Software DSM Applications using Visualisation
The Journal of Supercomputing
Parallel Programming and Performance Evaluation with the URSA Tool Family
International Journal of Parallel Programming
Performance Analysis and Tuning for a Single-Chip Multiprocessor DSP
IEEE Parallel & Distributed Technology: Systems & Technology
Trace Factory: Generating Workloads for Trace-Driven Simulation of Shared-Bus Multiprocessors
IEEE Parallel & Distributed Technology: Systems & Technology
Typestate Checking of Machine Code
ESOP '01 Proceedings of the 10th European Symposium on Programming Languages and Systems
HiPC '02 Proceedings of the 9th International Conference on High Performance Computing
The Monitoring and Steering Environment
ICCS '01 Proceedings of the International Conference on Computational Science-Part II
Fiddle: A Flexible Distributed Debugging Architecture
ICCS '01 Proceedings of the International Conference on Computational Science-Part II
Performance Optimization for Large Scale Computing: The Scalable VAMPIR Approach
ICCS '01 Proceedings of the International Conference on Computational Science-Part II
Run-Time Optimization Using Dynamic Performance Prediction
HPCN Europe 2000 Proceedings of the 8th International Conference on High-Performance Computing and Networking
Interoperability Support in Distributed On-Line Monitoring Systems
HPCN Europe 2000 Proceedings of the 8th International Conference on High-Performance Computing and Networking
On-Line Tool Support for Parallel Applications
HPCN Europe 2001 Proceedings of the 9th International Conference on High-Performance Computing and Networking
Dynamic Instrumentation and Performance Prediction of Application Execution
HPCN Europe 2001 Proceedings of the 9th International Conference on High-Performance Computing and Networking
Specification Techniques for Automatic Performance Analysis Tools
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
PDRS: A Performance Data Representation System
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
A Performance Advisor Tool for Shared-Memory Parallel Programming
LCPC '00 Proceedings of the 13th International Workshop on Languages and Compilers for Parallel Computing-Revised Papers
PaCT '999 Proceedings of the 5th International Conference on Parallel Computing Technologies
On Performance Modeling for HPF Applications with ASL
ISHPC '00 Proceedings of the Third International Symposium on High Performance Computing
Formalizing OpenMP Performance Properties with ASL
ISHPC '00 Proceedings of the Third International Symposium on High Performance Computing
A Preliminary Evaluation of FINESSE , a Feedback-Guided Performance Enhancement System
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
Automatic Performance Analysis of MPI Applications Based on Event Traces
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
A Framework for an Interoperable Tool Environment (Research Note)
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
A Management Architecture for Measuring and Monitoring the Behavior of Digital Libraries
ECDL '98 Proceedings of the Second European Conference on Research and Advanced Technology for Digital Libraries
Group-Based Performance Analysis for Multithreaded SMP Cluster Applications
Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
On the Scalability of Tracing Mechanisms
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Deep Start: A Hybrid Strategy for Automated Performance Problem Searches
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
SCALEA: A Performance Analysis Tool for Distributed and Parallel Programs
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Applying Human Factors to the Design of Performance Tools
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
A Callgraph-Based Search Strategy for Automated Performance Diagnosis (Distinguished Paper)
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
µProfiler: Profiling User-Level Threads in a Shared-Memory Programming Environment
ISCOPE '98 Proceedings of the Second International Symposium on Computing in Object-Oriented Parallel Environments
A Dynamic Tracing Mechanism for Performance Analysis of OpenMP Applications
WOMPAT '01 Proceedings of the International Workshop on OpenMP Applications and Tools: OpenMP Shared Memory Parallel Programming
Performance Issues in Parallel Processing Systems
Performance Evaluation: Origins and Directions
Multilayer Online-Monitoring for Hybrid DSM Systems on Top of PC Clusters with a SMiLE
TOOLS '00 Proceedings of the 11th International Conference on Computer Performance Evaluation: Modelling Techniques and Tools
Instrumentation Database for Performance Analysis of Parallel Scientific Applications
LCR '98 Selected Papers from the 4th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
An Infrastructure for Monitoring and Management in Computational Grids
LCR '00 Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Visual MPI - A Knowledge-Based System for Writing Efficient MPI Applications
Proceedings of the 6th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
A Community Databank for Performance Tracefiles
Proceedings of the 8th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Enhancing the Functionality of Performance Measurement Tools for Message Passing Environments
Proceedings of the 6th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
The CrossGrid Performance Analysis Tool for Interactive Grid Applications
Proceedings of the 9th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Performance Analysis for MPI Applications with SCALEA
Proceedings of the 9th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Profiling Facility on a Metasystem
HPCN Europe 2001 Proceedings of the 9th International Conference on High-Performance Computing and Networking
An Automated Benchmarking Toolset
HPCN Europe 2000 Proceedings of the 8th International Conference on High-Performance Computing and Networking
SIGMA: a simulator infrastructure to guide memory analysis
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Monitoring data archives for grid environments
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Scalable analysis techniques for microprocessor performance counter metrics
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
PET, a software monitoring toolkit for performance analysis of parallel embedded applications
Journal of Systems Architecture: the EUROMICRO Journal
Prophesy: an infrastructure for performance analysis and modeling of parallel and grid applications
ACM SIGMETRICS Performance Evaluation Review
Dynamic compilation for energy adaptation
Proceedings of the 2002 IEEE/ACM international conference on Computer-aided design
Interactive locality optimization on NUMA architectures
Proceedings of the 2003 ACM symposium on Software visualization
An infrastructure for adaptive dynamic optimization
Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Using thread-level speculation to simplify manual parallelization
Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
A Rule-based Approach for Automatic Bottleneck Detection in Programs on Shared
HIPS '97 Proceedings of the 1997 Workshop on High-Level Programming Models and Supportive Environments (HIPS '97)
A Performance Debugger for Eliminating Excess Synchronization in Shared-Memory Parallel Programs
MASCOTS '96 Proceedings of the 4th International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems
A Distributed Object-Oriented Framework for Tool Development
TOOLS '00 Proceedings of the Technology of Object-Oriented Languages and Systems (TOOLS 34'00)
Form: A Framework for Creating Views of Program Executions
ICSM '01 Proceedings of the IEEE International Conference on Software Maintenance (ICSM'01)
Monitoring and Characterization of Component-Based Systems with Global Causality Capture
ICDCS '03 Proceedings of the 23rd International Conference on Distributed Computing Systems
Specification of Performance Problems in MPI Programs with ASL
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Dynamic native optimization of interpreters
Proceedings of the 2003 workshop on Interpreters, virtual machines and emulators
DISE: a programmable macro engine for customizing applications
Proceedings of the 30th annual international symposium on Computer architecture
The Jrpm system for dynamically parallelizing Java programs
Proceedings of the 30th annual international symposium on Computer architecture
Flexible performance visualization of parallel and distributed applications
Future Generation Computer Systems - Tools for program development and analysis
Performance Tuning of Distributed Applications with CoSMoS
ICDCS '01 Proceedings of the The 21st International Conference on Distributed Computing Systems
Trace-Driven Debugging of Message Passing Programs
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
On-Demand Grid Application Tuning and Debugging with the NetLogger Activation Service
GRID '03 Proceedings of the 4th International Workshop on Grid Computing
Comparing Passive Network Monitoring of Grid Application Traffic with Active Probes
GRID '03 Proceedings of the 4th International Workshop on Grid Computing
Automatic performance analysis of hybrid MPI/OpenMP applications
Journal of Systems Architecture: the EUROMICRO Journal - Special issue: Evolutions in parallel distributed and network-based processing
Different approaches to automatic performance analysis of distributed applications
Performance analysis and grid computing
Collective communication patterns on the quadrics network
Performance analysis and grid computing
Uniform resource visualization: software and services
Performance analysis and grid computing
A performance analysis tool for interactive grid applications
Performance analysis and grid computing
Dynamic instrumentation for Java using a virtual JVM
Performance analysis and grid computing
Aksum: a performance analysis tool for parallel and distributed applications
Performance analysis and grid computing
The unicore grid and its options for performance analysis
Performance analysis and grid computing
Targeted Path Profiling: Lower Overhead Path Profiling for Staged Dynamic Optimization Systems
Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Agent construction in Mobile Surveyor
ACM-SE 42 Proceedings of the 42nd annual Southeast regional conference
Cache Simulation Based on Runtime Instrumentation for OpenMP Applications
ANSS '04 Proceedings of the 37th annual symposium on Simulation
The design and implementation of FIT: a flexible instrumentation toolkit
Proceedings of the 5th ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools and engineering
The first framework for interoperable run-time software tools
On-line monitoring systems and computer tool interoperability
Supporting on-line distributed monitoring and debugging
On-line monitoring systems and computer tool interoperability
A framework for multi-execution performance tuning
On-line monitoring systems and computer tool interoperability
Specifying performance properties of parallel applications using compound events
On-line monitoring systems and computer tool interoperability
Vertical profiling: understanding the behavior of object-priented applications
OOPSLA '04 Proceedings of the 19th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Practical Path Profiling for Dynamic Optimizers
Proceedings of the international symposium on Code generation and optimization
Performance Tool Support for MPI-2 on Linux
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Efficient, Unified, and Scalable Performance Monitoring for Multiprocessor Operating Systems
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
MRNet: A Software-Based Multicast/Reduction Network for Scalable Tools
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Synthetic Simulation of Mesh-Based Parallel Applications Driven by Fine-Grained Profiling
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 15 - Volume 16
Visualizing structural properties of irregular parallel computations
SoftVis '05 Proceedings of the 2005 ACM symposium on Software visualization
Demand-driven structural testing with dynamic instrumentation
Proceedings of the 27th international conference on Software engineering
IMPuLSE: integrated monitoring and profiling for large-scale environments
LCR '04 Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems
Instrumentation in software dynamic translators for self-managed systems
WOSS '04 Proceedings of the 1st ACM SIGSOFT workshop on Self-managed systems
A taxonomy of grid monitoring systems
Future Generation Computer Systems
An API for Runtime Code Patching
International Journal of High Performance Computing Applications
Grid Network Monitoring in the European Datagrid Project
International Journal of High Performance Computing Applications
A Performance Analysis Tool for Interactive Applications on the Grid
International Journal of High Performance Computing Applications
DGMonitor: A Performance Monitoring Tool for Sandbox-Based Desktop Grid Platforms
The Journal of Supercomputing
Tdb: a source-level debugger for dynamically translated programs
Proceedings of the sixth international symposium on Automated analysis-driven debugging
OOPSLA '05 Proceedings of the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Continuous Path and Edge Profiling
Proceedings of the 38th annual IEEE/ACM International Symposium on Microarchitecture
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Low overhead program monitoring and profiling
PASTE '05 Proceedings of the 6th ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools and engineering
Reducing dynamic compilation overhead by overlapping compilation and execution
ASP-DAC '06 Proceedings of the 2006 Asia and South Pacific Design Automation Conference
On-line automated performance diagnosis on thousands of processes
Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Scalable dynamic binary instrumentation for Blue Gene/L
ACM SIGARCH Computer Architecture News - Special issue on the 2005 workshop on binary instrumentation and application
Practical analysis of stripped binary code
ACM SIGARCH Computer Architecture News - Special issue on the 2005 workshop on binary instrumentation and application
Performance feature identification by comparative trace analysis
Future Generation Computer Systems
WAP5: black-box performance debugging for wide-area systems
Proceedings of the 15th international conference on World Wide Web
Spin Detection Hardware for Improved Management of Multithreaded Systems
IEEE Transactions on Parallel and Distributed Systems
Performance and environment monitoring for continuous program optimization
IBM Journal of Research and Development
Visualization aided performance tuning of irregular task-parallel computations
Information Visualization
Problem diagnosis in large-scale computing environments
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
MPI performance analysis tools on Blue Gene/L
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Grid harvest service: a performance system of grid computing
Journal of Parallel and Distributed Computing
Modeling master/worker applications for automatic performance tuning
Parallel Computing - Algorithmic skeletons
$P$^$3$$T+$: A performance estimator for distributed and parallel programs
Scientific Programming
Performance engineering, PSEs and the GRID
Scientific Programming
Grid environment for on-line application monitoring and performance analysis
Scientific Programming - AxGrids 2004
Making the "box" transparent: system call performance as a first-class result
ATEC '04 Proceedings of the annual conference on USENIX Annual Technical Conference
Dynamic instrumentation of production systems
ATEC '04 Proceedings of the annual conference on USENIX Annual Technical Conference
The Future of Software Performance Engineering
FOSE '07 2007 Future of Software Engineering
Techniques for Classifying Executions of Deployed Software to Support Software Engineering Tasks
IEEE Transactions on Software Engineering
Using hardware performance monitors to understand the behavior of java applications
VM'04 Proceedings of the 3rd conference on Virtual Machine Research And Technology Symposium - Volume 3
Scalability analysis of SPMD codes using expectations
Proceedings of the 21st annual international conference on Supercomputing
Interactive labs for experimental systems in education
ecs'07 Experimental computer science on Experimental computer science
AjaxScope: a platform for remotely monitoring the client-side behavior of web 2.0 applications
Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
High-level application-specific performance analysis using the G-PM tool
Future Generation Computer Systems
Dynamic binary instrumentation and data aggregation on large scale systems
International Journal of Parallel Programming
A productivity centered application performance tuning framework
Proceedings of the 2nd international conference on Performance evaluation methodologies and tools
Proceedings of the 6th annual IEEE/ACM international symposium on Code generation and optimization
The Tracefile Testbed: a community repository for identifying and retrieving HPC performance data
International Journal of High Performance Computing and Networking
Live monitoring: using adaptive instrumentation and analysis to debug and maintain web applications
HOTOS'07 Proceedings of the 11th USENIX workshop on Hot topics in operating systems
Characterizing the I/O behavior of scientific applications on the Cray XT
PDSW '07 Proceedings of the 2nd international workshop on Petascale data storage: held in conjunction with Supercomputing '07
DARC: dynamic analysis of root causes of latency distributions
SIGMETRICS '08 Proceedings of the 2008 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Rule-based automatic software performance diagnosis and improvement
WOSP '08 Proceedings of the 7th international workshop on Software and performance
Controlled dynamic performance analysis
WOSP '08 Proceedings of the 7th international workshop on Software and performance
Knowledge support and automation for performance analysis with PerfExplorer 2.0
Scientific Programming - Large-Scale Programming Tools and Environments
Open | SpeedShop: An open source infrastructure for parallel performance analysis
Scientific Programming - Large-Scale Programming Tools and Environments
An Interactive Graphical Environment for Code Optimization
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part II
Automatic Memory Access Analysis with Periscope
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part II
On-Line Performance Modeling for MPI Applications
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
A Calibration Framework for Capturing and Calibrating Software Performance Models
EPEW '08 Proceedings of the 5th European Performance Engineering Workshop on Computer Performance Engineering
Tools for scalable parallel program analysis: Vampir NG, MARMOT, and DeWiz
International Journal of Computational Science and Engineering
Examples of monitoring and program analysis activities with DeWiz
International Journal of Computational Science and Engineering
A Process Scheduling Analysis Model Based on Grid Environment
ICA3PP '09 Proceedings of the 9th International Conference on Algorithms and Architectures for Parallel Processing
A Holistic Approach towards Automated Performance Analysis and Tuning
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Measuring causal propagation of overhead of inefficiencies in parallel applications
PDCS '07 Proceedings of the 19th IASTED International Conference on Parallel and Distributed Computing and Systems
Diagnosing performance bottlenecks in emerging petascale applications
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
IBM Journal of Research and Development
A taxonomy of grid monitoring systems
Future Generation Computer Systems
Performance feature identification by comparative trace analysis
Future Generation Computer Systems
Monitoring and steering Grid applications with GRID superscalar
Future Generation Computer Systems
Interfacing computer aided parallelization and performance analysis
ICCS'03 Proceedings of the 2003 international conference on Computational science
Identification of performance characteristics from multi-view trace analysis
ICCS'03 Proceedings of the 2003 international conference on Computational science: PartIII
Monitoring and performance analysis of grid applications
ICCS'03 Proceedings of the 1st international conference on Computational science: PartI
Analyzing blocking to debug performance problems on multi-core systems
ACM SIGOPS Operating Systems Review
Monitoring MPI programs for performance characterization and management control
Proceedings of the 2010 ACM Symposium on Applied Computing
Automatic tuning in computational grids
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Automated performance analysis using ASL performance properties
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Automatic monitoring of memory hierarchies in threaded applications with AMEBA
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Balancing thread partition for efficiently exploiting speculative thread-level parallelism
APPT'07 Proceedings of the 7th international conference on Advanced parallel processing technologies
Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 1
Rule-based automatic software performance diagnosis and improvement
Performance Evaluation
The Cilkview scalability analyzer
Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
HieraAnalyses – a tool for hierarchical analysis of parallel programs
International Journal of High Performance Systems Architecture
A query language and runtime tool for evaluating behavior of multi-tier servers
Proceedings of the ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Software—Practice & Experience
AjaxScope: A Platform for Remotely Monitoring the Client-Side Behavior of Web 2.0 Applications
ACM Transactions on the Web (TWEB)
Cell broadband engine processor performance optimization: tracing tools implementation and use
IBM Journal of Research and Development
Diagnosing mobile applications in the wild
Hotnets-IX Proceedings of the 9th ACM SIGCOMM Workshop on Hot Topics in Networks
Using dynamic compilation for continuing execution under reduced memory availability
Proceedings of the Conference on Design, Automation and Test in Europe
A virtual file system interface for computational grids
EUNICE'10 Proceedings of the 16th EUNICE/IFIP WG 6.6 conference on Networked services and applications: engineering, control and management
Scalable Identification of Load Imbalance in Parallel Executions Using Call Path Profiles
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
PerfExpert: An Easy-to-Use Performance Diagnosis Tool for HPC Applications
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Automatic performance analysis of large scale simulations
Euro-Par'09 Proceedings of the 2009 international conference on Parallel processing
Using computational intelligence to identify performance bottlenecks in a computer system
PPSN'10 Proceedings of the 11th international conference on Parallel problem solving from nature: Part I
International Journal of High Performance Computing Applications
Performance instrumentation and compiler optimizations for MPI/OpenMP applications
IWOMP'05/IWOMP'06 Proceedings of the 2005 and 2006 international conference on OpenMP shared memory parallel programming
Performance analysis for teraflop computers: a distributed automatic approach
EUROMICRO-PDP'02 Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing
DeWiz - event-based debugging on the grid
EUROMICRO-PDP'02 Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing
FINESSE: a prototype feedback-guided performance enhancement system
EURO-PDP'00 Proceedings of the 8th Euromicro conference on Parallel and distributed processing
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
An entropy-based algorithm for time-driven software instrumentation in parallel systems
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
A study of MPI performance analysis tools on blue gene/L
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Behavior-based problem localization for parallel file systems
HotDep'10 Proceedings of the Sixth international conference on Hot topics in system dependability
Ad hoc synchronization considered harmful
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
A static model for reverse engineering of software threads and their interactions
ACOS'06 Proceedings of the 5th WSEAS international conference on Applied computer science
Towards production monitoring of application progress
Proceedings of the 4th International Workshop on Software Engineering for Computational Science and Engineering
An idiom-finding tool for increasing productivity of accelerators
Proceedings of the international conference on Supercomputing
Just in time: adding value to the IO pipelines of high performance applications with JITStaging
Proceedings of the 20th international symposium on High performance distributed computing
Performance visualization for large-scale computing systems: a literature review
HCII'11 Proceedings of the 14th international conference on Human-computer interaction: design and development approaches - Volume Part I
TAUmon: scalable online performance data analysis in TAU
Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
Kismet: parallel speedup estimates for serial programs
Proceedings of the 2011 ACM international conference on Object oriented programming systems languages and applications
TAUg: runtime global performance data access using MPI
EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface
Tracing the MPI-IO calls' disk accesses
EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface
Jazz2: a flexible and extensible framework for structural testing in a Java VM
Proceedings of the 9th International Conference on Principles and Practice of Programming in Java
High-level application specific performance analysis using the G-PM tool
PVM/MPI'05 Proceedings of the 12th European PVM/MPI users' group conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
A loop-aware search strategy for automated performance analysis
HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
Performance analysis of shared-memory parallel applications using performance properties
HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
Monitoring MPI running nodes status for load balance
GCC'05 Proceedings of the 4th international conference on Grid and Cooperative Computing
LOMARC — lookahead matchmaking for multi-resource coscheduling
JSSPP'04 Proceedings of the 10th international conference on Job Scheduling Strategies for Parallel Processing
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
Model-based performance diagnosis of master-worker parallel computations
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
PerWiz: a what-if prediction tool for tuning message passing programs
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Modeling pipeline applications in POETRIES
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Automatic tuning of master/worker applications
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
An efficient multi-level trace toolkit for multi-threaded applications
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Enabling coupled scientific simulations on the grid
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
An expert assistant for computer aided parallelization
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
A dynamic supervising model based on grid environment
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
An instrumentation infrastructure for grid workflow applications
ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part II
PARADIS: analysis of transaction-based applications in distributed environments
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part II
DynTG: a tool for interactive, dynamic instrumentation
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part II
A performance contract system in a grid enabling, component based programming environment
EGC'05 Proceedings of the 2005 European conference on Advances in Grid Computing
Finding inefficiencies in OpenMP applications automatically with periscope
ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part II
Profiling-as-a-Service: adaptive scalable resource profiling for the cloud in the cloud
ICSOC'11 Proceedings of the 9th international conference on Service-Oriented Computing
Efficient experiment selection in automated software performance evaluations
EPEW'11 Proceedings of the 8th European conference on Computer Performance Engineering
Pinpointing data locality problems using data-centric analysis
CGO '11 Proceedings of the 9th Annual IEEE/ACM International Symposium on Code Generation and Optimization
Understanding and detecting real-world performance bugs
Proceedings of the 33rd ACM SIGPLAN conference on Programming Language Design and Implementation
A methodology for transparent knowledge specification in a dynamic tuning environment
Software—Practice & Experience
Performance analysis of SCOOP programs
Journal of Systems and Software
Rule-based automatic software performance diagnosis and improvement
Performance Evaluation
TA UoverSupermon: low-overhead online parallel performance monitoring
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Practical differential profiling
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Search strategies for automatic performance analysis tools
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Detecting application load imbalance on high end massively parallel systems
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Analysis of the MPI-IO optimization levels with the PIOViz Jumpshot enhancement
PVM/MPI'07 Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
A multi-level monitoring framework for stream-based coordination programs
ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Compiler help for binary manipulation tools
Euro-Par'12 Proceedings of the 18th international conference on Parallel processing workshops
Performance problem diagnostics by systematic experimentation
Proceedings of the 18th international doctoral symposium on Components and architecture
Automated root cause isolation of performance regressions during software development
Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering
An experiment specification language for goal-driven, automated performance evaluations
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Supporting swift reaction: automatically uncovering performance problems by systematic experiments
Proceedings of the 2013 International Conference on Software Engineering
Understanding the formation of wait states in applications with one-sided communication
Proceedings of the 20th European MPI Users' Group Meeting
Enabling comprehensive data-driven system management for large computational facilities
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
A data-centric profiler for parallel programs
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Making problem diagnosiswork for large-scale, production storage systems
LISA'13 Proceedings of the 27th international conference on Large Installation System Administration
Runtime verification using policy-based approach to control information flow
International Journal of Security and Networks
Distributed debugging for mobile networks
Journal of Systems and Software
Hi-index | 4.10 |
The Paradyn performance-measurement tool uses several novel technologies to scale to long-running programs and large systems and automate much of the search for performance bottlenecks. The tool is based on a dynamic notion of performance instrumentation and measurement. Application programs are placed into execution, and performance instrumentation is inserted into the running programs and modified during execution. The performance consultant module automatically directs instrumentation placement and associates bottlenecks with specific causes and specific program parts. Paradyn controls its instrumentation overhead by monitoring the cost of its data collection and limiting its instrumentation to a user-controllable threshold. Paradyn's instrumentation can be configured to accept new operating systems, hardware, and application-specific performance data. Paradyn also provides an open interface for performance visualization and a simple programming library to interface visualizations. Paradyn can gather and present performance data in terms of high-level parallel languages (such as data-parallel Fortran) and can measure programs on massively parallel computers, workstation clusters, and heterogeneous combinations of these systems. The article presents two examples that illustrate Paradyn's use in finding performance problems in real applications.