The Cetus Source-to-Source Compiler Infrastructure: Overview and Evaluation

Authors:
Hansang Bae;Dheya Mustafa;Jae-Woo Lee; Aurangzeb;Hao Lin;Chirag Dave;Rudolf Eigenmann;Samuel P. Midkiff
Affiliations:
Purdue University, West Lafayette, USA 47907;Purdue University, West Lafayette, USA 47907;Purdue University, West Lafayette, USA 47907;Purdue University, West Lafayette, USA 47907;Purdue University, West Lafayette, USA 47907;Qualcomm, San Diego, USA 92121;Purdue University, West Lafayette, USA 47907;Purdue University, West Lafayette, USA 47907
Venue:
International Journal of Parallel Programming
Year:
2013

Citing 26
Cited 1

The program summary graph and flow-sensitive interprocedual data flow analysis

PLDI '88 Proceedings of the ACM SIGPLAN 1988 conference on Programming Language design and Implementation
Vectorizing compilers: a test suite and results

Proceedings of the 1988 ACM/IEEE conference on Supercomputing
A comparison study of automatically vectorizing Fortran compilers

Proceedings of the 1989 ACM/IEEE conference on Supercomputing
Context-sensitive interprocedural points-to analysis in the presence of function pointers

PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
On the Automatic Parallelization of the Perfect Benchmarks®

IEEE Transactions on Parallel and Distributed Systems
Optimizing compilers for modern architectures: a dependence-based approach

Optimizing compilers for modern architectures: a dependence-based approach
Optimizing Supercompilers for Supercomputers

Optimizing Supercompilers for Supercomputers
The range test: a dependence test for symbolic, non-linear expressions

Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Parallel Programming with Polaris

Computer
An Empirical Study of Fortran Programs for Parallelizing Compilers

IEEE Transactions on Parallel and Distributed Systems
Performance Analysis of Parallelizing Compilers on the Perfect Benchmarks Programs

IEEE Transactions on Parallel and Distributed Systems
Automatic Array Privatization

Proceedings of the 6th International Workshop on Languages and Compilers for Parallel Computing
Portable Compilers for OpenMP

WOMPAT '01 Proceedings of the International Workshop on OpenMP Applications and Tools: OpenMP Shared Memory Parallel Programming
Optimizing irregular shared-memory applications for distributed-memory systems

Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Artemis: practical runtime monitoring of applications for execution anomalies

Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
The OpenTM Transactional Application Programming Interface

PACT '07 Proceedings of the 16th International Conference on Parallel Architecture and Compilation Techniques
OpenMP to GPGPU: a compiler framework for automatic translation and optimization

Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming
High-performance CUDA kernel execution on FPGAs

Proceedings of the 23rd international conference on Supercomputing
A cross-input adaptive framework for GPU program optimizations

IPDPS '09 Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing
Cetus: A Source-to-Source Compiler Infrastructure for Multicores

Computer
An optimizing compiler for GPGPU programs with input-data sharing

Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
A GPGPU compiler for memory optimization and parallelism management

PLDI '10 Proceedings of the 2010 ACM SIGPLAN conference on Programming language design and implementation
The Effect of Omitted-Variable Bias on the Evaluation of Compiler Optimizations

Computer
PATUS: A Code Generation and Autotuning Framework for Parallel Iterative Stencil Computations on Modern Microarchitectures

IPDPS '11 Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium
Enhancing the Role of Inlining in Effective Interprocedural Parallelization

ICPP '11 Proceedings of the 2011 International Conference on Parallel Processing
Portable section-level tuning of compiler parallelized applications

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis

On Expressing Strategies for Directive-Driven Multicore Programing Models

Proceedings of Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and Design Tools and Architectures for Multicore Embedded Computing Platforms

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper provides an overview and an evaluation of the Cetus source-to-source compiler infrastructure. The original goal of the Cetus project was to create an easy-to-use compiler for research in automatic parallelization of C programs. In meantime, Cetus has been used for many additional program transformation tasks. It serves as a compiler infrastructure for many projects in the US and internationally. Recently, Cetus has been supported by the National Science Foundation to build a community resource. The compiler has gone through several iterations of benchmark studies and implementations of those techniques that could improve the parallel performance of these programs. These efforts have resulted in a system that favorably compares with state-of-the-art parallelizers, such as Intel's ICC. A key limitation of advanced optimizing compilers is their lack of runtime information, such as the program input data. We will discuss and evaluate several techniques that support dynamic optimization decisions. Finally, as there is an extensive body of proposed compiler analyses and transformations for parallelization, the question of the importance of the techniques arises. This paper evaluates the impact of the individual Cetus techniques on overall program performance.