Loop fusion for memory space optimization

Authors:
Antoine Fraboulet;Karen Kodary;Anne Mignotte
Affiliations:
Institut National des Sciences Appliquées de Lyon, Villeurbanne, France;Institut National des Sciences Appliquées de Lyon, Villeurbanne, France;Institut National des Sciences Appliquées de Lyon, Villeurbanne, France
Venue:
Proceedings of the 14th international symposium on Systems synthesis
Year:
2001

Citing 13
Cited 14

Graphs and algorithms

Graphs and algorithms
Compilers: principles, techniques, and tools

Compilers: principles, techniques, and tools
Theory of linear and integer programming

Theory of linear and integer programming
Memory-hierarchy management

Memory-hierarchy management
Compiler transformations for high-performance computing

ACM Computing Surveys (CSUR)
Improving data locality with loop transformations

ACM Transactions on Programming Languages and Systems (TOPLAS)
Architectural exploration and optimization of local memory in embedded systems

ISSS '97 Proceedings of the 10th international symposium on System synthesis
Surviving the SOC revolution: a guide to platform-based design

Surviving the SOC revolution: a guide to platform-based design
On the complexity of loop fusion

Parallel Computing - Special issue on new trends on scheduling in parallel and distributed systems
Collective Loop Fusion for Array Contraction

Proceedings of the 5th International Workshop on Languages and Compilers for Parallel Computing
Maximizing Loop Parallelism and Improving Data Locality via Loop Fusion and Distribution

Proceedings of the 6th International Workshop on Languages and Compilers for Parallel Computing
ADOPT: Efficient Hardware Address Generation in Distributed Memory Architectures

ISSS '96 Proceedings of the 9th international symposium on System synthesis
Loop Alignment for Memory Accesses Optimization

Proceedings of the 12th international symposium on System synthesis

Improving whole-program locality using intra-procedural and inter-procedural transformations

Journal of Parallel and Distributed Computing
A polynomial-time algorithm for memory space reduction

International Journal of Parallel Programming
Energy-aware computation duplication for improving reliability in embedded chip multiprocessors

ASP-DAC '06 Proceedings of the 2006 Asia and South Pacific Design Automation Conference
Multi-compilation: capturing interactions among concurrently-executing applications

Proceedings of the 3rd conference on Computing frontiers
2D data locality: definition, abstraction, and application

ICCAD '05 Proceedings of the 2005 IEEE/ACM International conference on Computer-aided design
MPSoC memory optimization using program transformation

ACM Transactions on Design Automation of Electronic Systems (TODAES)
Buffer and Register Allocation for Memory Space Optimization

Journal of VLSI Signal Processing Systems
DART: a functional-level reconfigurable architecture for high energy efficiency

EURASIP Journal on Embedded Systems - Reconfigurable Computing and Hardware/Software Codesign
Reducing memory requirements of resource-constrained applications

ACM Transactions on Embedded Computing Systems (TECS)
Multiprocessor, Multithreading and Memory Optimization for On-Chip Multimedia Applications

Journal of Signal Processing Systems
Loop transformations for reducing data space requirements of resource-constrained applications

SAS'03 Proceedings of the 10th international conference on Static analysis
Loop fusion and reordering for register file optimization on stream processors

Journal of Systems and Software
Integrating Memory Optimization with Mapping Algorithms for Multi-Processors System-on-Chip

ACM Transactions on Embedded Computing Systems (TECS)
Dataflow-driven GPU performance projection for multi-kernel transformations

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis

Quantified Score

Hi-index	0.01

Visualization

Abstract

Portable or embedded systems as well as submicronic technologies have made the power consumption criterium crucial. Memory is known to be extremely power consuming. Moreover multimedia applications are memory intensive applications. Therefore, we propose new techniques to optimize a behavioral description of multimedia applications before the hardware/software partitioning (Codesign). These transformations are performed on "for" loops that constitute the main parts which handle the arrays of the multimedia code. This paper presents an optimal algorithm to reduce the use of temporary arrays by loop fusion. Although the algorithm is not polynomial, experiments have shown that it is very efficient.