Predicting the impact of optimizations for embedded systems

Authors:
Min Zhao;Bruce Childers;Mary Lou Soffa
Affiliations:
University of Pittsburgh;University of Pittsburgh;University of Pittsburgh
Venue:
Proceedings of the 2003 ACM SIGPLAN conference on Language, compiler, and tool for embedded systems
Year:
2003

Citing 20
Cited 23

A data locality optimizing algorithm

PLDI '91 Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation
A general framework for iteration-reordering loop transformations

PLDI '92 Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation
Cache interference phenomena

SIGMETRICS '94 Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Compiler transformations for high-performance computing

ACM Computing Surveys (CSUR)
Combining analyses, combining optimizations

ACM Transactions on Programming Languages and Systems (TOPLAS)
Improving data locality with loop transformations

ACM Transactions on Programming Languages and Systems (TOPLAS)
A quantitative analysis of loop nest locality

Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
MediaBench: a tool for evaluating and synthesizing multimedia and communicatons systems

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
An approach for exploring code improving transformations

ACM Transactions on Programming Languages and Systems (TOPLAS)
The SimpleScalar tool set, version 2.0

ACM SIGARCH Computer Architecture News
Automatic selection of high-order transformations in the IBM XL FORTRAN compilers

IBM Journal of Research and Development - Special issue: performance analysis and its impact on design
Data transformations for eliminating conflict misses

PLDI '98 Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation
Improving Cache Locality by a Combination of Loop and Data Transformations

IEEE Transactions on Computers - Special issue on cache memory and related problems
Cache miss equations: a compiler framework for analyzing and tuning memory behavior

ACM Transactions on Programming Languages and Systems (TOPLAS)
Energy-driven integrated hardware-software optimizations using SimplePower

Proceedings of the 27th annual international symposium on Computer architecture
VISTA: a system for interactive code improvement

Proceedings of the joint conference on Languages, compilers and tools for embedded systems: software and compilers for embedded systems
Compiler-directed cache polymorphism

Proceedings of the joint conference on Languages, compilers and tools for embedded systems: software and compilers for embedded systems
A Cost Framework for Evaluating Integrated Restructuring Optimizations

Proceedings of the 2001 International Conference on Parallel Architectures and Compilation Techniques
Compiler optimization-space exploration

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Caches as Filters: A New Approach to Cache Analysis

MASCOTS '98 Proceedings of the 6th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems

A trace-based binary compilation framework for energy-aware computing

Proceedings of the 2004 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems
Finding effective compilation sequences

Proceedings of the 2004 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems
A Model-Based Framework: An Approach for Profit-Driven Optimization

Proceedings of the international symposium on Code generation and optimization
Optimizing general purpose compiler optimization

Proceedings of the 2nd conference on Computing frontiers
ACME: adaptive compilation made efficient

LCTES '05 Proceedings of the 2005 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems
Generating new general compiler optimization settings

Proceedings of the 19th annual international conference on Supercomputing
Post-compilation optimization for multiple gains with pattern matching

ACM SIGPLAN Notices
Exploring the structure of the space of compilation sequences using randomized search algorithms

The Journal of Supercomputing
An approach toward profit-driven optimization

ACM Transactions on Architecture and Code Optimization (TACO)
Trace-based leakage energy optimisations at link time

Journal of Systems Architecture: the EUROMICRO Journal
Microarchitecture Sensitive Empirical Models for Compiler Optimizations

Proceedings of the International Symposium on Code Generation and Optimization
Evaluating Heuristic Optimization Phase Order Search Algorithms

Proceedings of the International Symposium on Code Generation and Optimization
Optimisation Validation

Electronic Notes in Theoretical Computer Science (ENTCS)
Cole: compiler optimization level exploration

Proceedings of the 6th annual IEEE/ACM international symposium on Code generation and optimization
Program optimization carving for GPU computing

Journal of Parallel and Distributed Computing
Practical exhaustive optimization phase order exploration and evaluation

ACM Transactions on Architecture and Code Optimization (TACO)
Portable compiler optimisation across embedded programs and microarchitectures using machine learning

Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture
Improving both the performance benefits and speed of optimization phase sequence searches

Proceedings of the ACM SIGPLAN/SIGBED 2010 conference on Languages, compilers, and tools for embedded systems
An adaptive strategy for inline substitution

CC'08/ETAPS'08 Proceedings of the Joint European Conferences on Theory and Practice of Software 17th international conference on Compiler construction
Techniques and tools for dynamic optimization

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
On the impact of data input sets on statistical compiler tuning

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
McFLAT: a profile-based framework for MATLAB loop analysis and transformations

LCPC'10 Proceedings of the 23rd international conference on Languages and compilers for parallel computing
Approximating Pareto optimal compiler optimization sequences—a trade-off between WCET, ACET and code size

Software—Practice & Experience

Quantified Score

Hi-index	0.00

Visualization

Abstract

When applying optimizations, a number of decisions are made using fixed strategies, such as always applying an optimization if it is applicable, applying optimizations in a fixed order and assuming a fixed configuration for optimizations such as tile size and loop unrolling factor. While it is widely recognized that these fixed strategies may not be the most appropriate for producing high quality code, especially for embedded systems, there are no general and automatic strategies that do otherwise. In this paper, we present a framework that enables these decisions to be made based on predicting the impact of an optimization, taking into account resources and code context. The framework consists of optimization models, code models and resource models, which are integrated for predicting the impact of applying optimizations. Because data cache performance is important to embedded codes, we focus on cache performance and present an instance of the framework for cache performance in this paper. Since most opportunities for cache improvement come from loop optimizations, we describe code, optimization and cache models tailored to predict the impact of applying loop optimizations for data locality. Experimentally we demonstrate the need to selectively apply optimizations and show the performance benefit of our framework in predicting when to apply an optimization. We also show that our framework can be used to choose the most beneficial optimization when a number of optimizations can be applied to a loop nest. And lastly, we show that we can use the framework to combine optimizations on a loop nest.