Implementing stack simulation for highly-associative memories
SIGMETRICS '91 Proceedings of the 1991 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Dynamic dependency analysis of ordinary programs
ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
Experiences using the ParaScope Editor: an interactive parallel programming tool
PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
The high performance Fortran handbook
The high performance Fortran handbook
Automatic and interactive parallelization
Automatic and interactive parallelization
An integrated compilation and performance analysis environment for data parallel programs
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
MediaBench: a tool for evaluating and synthesizing multimedia and communicatons systems
MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
Smart Memories: a modular reconfigurable architecture
Proceedings of the 27th annual international symposium on Computer architecture
High Performance Compilers for Parallel Computing
High Performance Compilers for Parallel Computing
A stream compiler for communication-exposed architectures
Proceedings of the 10th international conference on Architectural support for programming languages and operating systems
IEEE Micro
Loop-Level Parallelism in Numeric and Symbolic Programs
IEEE Transactions on Parallel and Distributed Systems
Using thread-level speculation to simplify manual parallelization
Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
Exploiting ILP, TLP, and DLP with the polymorphous TRIPS architecture
Proceedings of the 30th annual international symposium on Computer architecture
Reuse Distance Analysis
Hi-index | 0.00 |
The SAPIENT parallel analysis framework facilitates the efficient transformation of sequential applications into multilevel parallel applications that can be executed on polymorphic chip multiprocessor architectures. We demonstrate how application characteristics are used to detect thread and data level parallelism in sequential applications and estimate parallel performance. We further demonstrate how SAPIENT determines the combination of application parallelism and polymorphic architecture configuration that maximizes performance. As an example, we present a detailed analysis of parallelism for an MPEG-2 decoder. We further summarize results for six other multimedia applications, identifying the presence of data and thread level parallelism, evaluating performance, and suggesting architecture configurations for each.