Loop Transformations for Restructuring Compilers: The Foundations

Authors:
Utpal K. Banerjee
Affiliations:
-
Venue:
Loop Transformations for Restructuring Compilers: The Foundations
Year:
1993

Citing 0
Cited 100

Advanced compiler optimizations for sparse computations

Proceedings of the 1993 ACM/IEEE conference on Supercomputing
A general data dependence test for dynamic, pointer-based data structures

PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
Nonzero structure analysis

ICS '94 Proceedings of the 8th international conference on Supercomputing
Minimization of memory traffic in high-level synthesis

DAC '94 Proceedings of the 31st annual Design Automation Conference
Avoiding conditional branches by code replication

PLDI '95 Proceedings of the ACM SIGPLAN 1995 conference on Programming language design and implementation
Symbolic array dataflow analysis for array privatization and program parallelization

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
System level verification of video and image processing specifications

ISSS '95 Proceedings of the 8th international symposium on System synthesis
Advanced compilation techniques in the PARADIGM compiler for distributed-memory multicomputers

ICS '95 Proceedings of the 9th international conference on Supercomputing
Automatic Data Structure Selection and Transformation for Sparse Matrix Computations

IEEE Transactions on Parallel and Distributed Systems
Parallelizing compilers

ACM Computing Surveys (CSUR)
Data-localization for Fortran macro-dataflow computation using partial static task assignment

ICS '96 Proceedings of the 10th international conference on Supercomputing
Cache miss equations: an analytical representation of cache misses

ICS '97 Proceedings of the 11th international conference on Supercomputing
Maximizing parallelism and minimizing synchronization with affine transforms

Proceedings of the 24th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Resource sharing in hierarchical synthesis

ICCAD '97 Proceedings of the 1997 IEEE/ACM international conference on Computer-aided design
The Static Parallelization of Loops and Recursions

The Journal of Supercomputing - Special issue: high performance computing systems
The automatic generation of sparse primitives

ACM Transactions on Mathematical Software (TOMS)
System-Level Data-Flow Transformation Exploration andPower-Area Trade-offs Demonstrated on Video Codecs

Journal of VLSI Signal Processing Systems - Special issue on systematic trade-off analysis in signal processing systems design
Precise miss analysis for program transformations with caches of arbitrary associativity

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Constraint-based array dependence analysis

ACM Transactions on Programming Languages and Systems (TOPLAS)
New shape analysis techniques for automatic parallelization of C codes

ICS '99 Proceedings of the 13th international conference on Supercomputing
An affine partitioning algorithm to maximize parallelism and minimize communication

ICS '99 Proceedings of the 13th international conference on Supercomputing
Nonlinear array layouts for hierarchical memory systems

ICS '99 Proceedings of the 13th international conference on Supercomputing
Cache miss equations: a compiler framework for analyzing and tuning memory behavior

ACM Transactions on Programming Languages and Systems (TOPLAS)
Compiler and Run-Time Support for Exploiting Regularity within Irregular Applications

IEEE Transactions on Parallel and Distributed Systems
Automated cache optimizations using CME driven diagnosis

Proceedings of the 14th international conference on Supercomputing
Generation of Efficient Nested Loops from Polyhedra

International Journal of Parallel Programming - Special issue on instruction-level parallelism and parallelizing compilation, part 2
Optimizing memory usage in the polyhedral model

ACM Transactions on Programming Languages and Systems (TOPLAS)
The Efficient Computation of Ownership Sets in HPF

IEEE Transactions on Parallel and Distributed Systems
Code generation for embedded processors

ISSS '00 Proceedings of the 13th international symposium on System synthesis
Automatic Compilation of Loops to Exploit Operator Parallelism on Configurable Arithmetic Logic Units

IEEE Transactions on Parallel and Distributed Systems
Integrating loop and data transformations for global optimization

Journal of Parallel and Distributed Computing
Precise Data Locality Optimization of Nested Loops

The Journal of Supercomputing
Enabling unimodular transformations

Proceedings of the 1994 ACM/IEEE conference on Supercomputing
New Shape Analysis and Interprocedural Techniques for Automatic Parallelization of C Codes

International Journal of Parallel Programming
NaraView: An Interactive 3D Visualization System for Parallelization of Programs

International Journal of Parallel Programming
Power-Aware Microarchitecture: Design and Modeling Challenges for Next-Generation Microprocessors

IEEE Micro
Loop Restructuring for Data I/O Minimization on Limited On-Chip Memory Embedded Processors

IEEE Transactions on Computers
New shape analysis and interprocedural techniques for automatic parallelization of C codes

International Journal of Parallel Programming
Profiling Dependence Vectors for Loop Parallelization

IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Exploiting Ownership Sets in HPF

LCPC '00 Proceedings of the 13th International Workshop on Languages and Compilers for Parallel Computing-Revised Papers
Dynamic Memory Oriented Transformations in the MPEG4 IM1-Player on a Low Power Platform

PACS '00 Proceedings of the First International Workshop on Power-Aware Computer Systems-Revised Papers
Interprocedural Transformations for Extracting Maximum Parallelism

ADVIS '02 Proceedings of the Second International Conference on Advances in Information Systems
Data Sequence Locality: A Generalization of Temporal Locality

Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
Efficient Dependence Analysis for Java Arrays

Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
A Neural Network Based Tool for Semi-automatic Code Transformation

VECPAR '00 Selected Papers and Invited Talks from the 4th International Conference on Vector and Parallel Processing
Sparse Jacobian Computation in Automatic Differentiation by Static Program Analysis

SAS '98 Proceedings of the 5th International Symposium on Static Analysis
A Framework for Loop Distribution on Limited On-Chip Memory Processors

CC '00 Proceedings of the 9th International Conference on Compiler Construction
On the parallelization of loop nests containing while loops

PAS '95 Proceedings of the First Aizu International Symposium on Parallel Algorithms/Architecture Synthesis
Partitioning Loops with Variable Dependence Distances

ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Static analysis of parameterized loop nests for energy efficient use of data caches

Compilers and operating systems for low power
A fast and accurate framework to analyze and optimize cache memory behavior

ACM Transactions on Programming Languages and Systems (TOPLAS)
Linear data distribution based on index analysis

High performance scientific and engineering computing
Single-Dimension Software Pipelining for Multi-Dimensional Loops

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Optimizing array reference checking in Java programs

IBM Systems Journal
Line Size Adaptivity Analysis of Parameterized Loop Nests for Direct Mapped Data Cache

IEEE Transactions on Computers
A Polynomial-Time Dependence Test for Determining Integer-Valued Solutions in Multi-Dimensional Arrays Under Variable Bounds

The Journal of Supercomputing
Exploitation of parallelism to nested loops with dependence cycles

Journal of Systems Architecture: the EUROMICRO Journal
An efficient way to filter out data dependences with a sufficiently large distance between memory references

ACM SIGPLAN Notices
A novel approach for partitioning iteration spaces with variable densities

Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming
Hierarchical memory size estimation for loop fusion and loop shifting in data-dominated applications

ASP-DAC '06 Proceedings of the 2006 Asia and South Pacific Design Automation Conference
A New Approach to Parallelization of Serial Nested Loops Using Genetic Algorithms

The Journal of Supercomputing
A general approach for partitioning N-dimensional parallel nested loops with conditionals

Proceedings of the eighteenth annual ACM symposium on Parallelism in algorithms and architectures
An algebraic array shape inference system for MATLAB®

ACM Transactions on Programming Languages and Systems (TOPLAS)
Single-dimension software pipelining for multidimensional loops

ACM Transactions on Architecture and Code Optimization (TACO)
Reducing off-chip memory access via stream-conscious tiling on multimedia applications

International Journal of Parallel Programming
A scalable embedded JPEG 2000 architecture

Journal of Systems Architecture: the EUROMICRO Journal
Incremental hierarchical memory size estimation for steering of loop transformations

ACM Transactions on Design Automation of Electronic Systems (TODAES)
Query responsive awareness software: inventory control case study

Proceedings of the 2nd international conference on Ubiquitous information management and communication
One-dimensional I test and direction vector I test with array references by induction variable

International Journal of High Performance Computing and Networking
A multi-dimensional Interval Reduction test

International Journal of High Performance Computing and Networking
Composition of Loop Modules in the Structural Blanks Approach to Programming with Recurrences: A Task of Synthesis of Nested Loops

Informatica
On the exploitation of loop-level parallelism in embedded applications

ACM Transactions on Embedded Computing Systems (TECS)
Transformations techniques for extracting parallelism in non-uniform nested loops

WSEAS Transactions on Computers
MEMMU: Memory expansion for MMU-less embedded systems

ACM Transactions on Embedded Computing Systems (TECS)
Affine and unimodular transformations for non-uniform nested loops

ICCOMP'08 Proceedings of the 12th WSEAS international conference on Computers
Harnessing a Refinement Theory to Compute Loop Functions

Electronic Notes in Theoretical Computer Science (ENTCS)
Mathematics for reasoning about loop functions

Science of Computer Programming
Modern development methods and tools for embedded reconfigurable systems: A survey

Integration, the VLSI Journal
A program auto-parallelizer based on the component technology of optimizing compiler construction

Programming and Computing Software
Parallel loop generation and scheduling

The Journal of Supercomputing
Modeling and exploiting spatial locality trade-offs in wavelet-based applications under varying resource requirements

ACM Transactions on Embedded Computing Systems (TECS)
The Fortran parallel transformer and its programming environment

Information Sciences: an International Journal
Loop parallelization in multi-dimensional cartesian space

PSI'06 Proceedings of the 6th international Andrei Ershov memorial conference on Perspectives of systems informatics
A meta-heuristic approach to parallel code generation

VECPAR'02 Proceedings of the 5th international conference on High performance computing for computational science
Automatic program parallelization for multicore processors

PPAM'09 Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part I
Automatic code generation for distributed memory architectures in the polytope model

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
McFLAT: a profile-based framework for MATLAB loop analysis and transformations

LCPC'10 Proceedings of the 23rd international conference on Languages and compilers for parallel computing
Induction variable analysis with delayed abstractions

HiPEAC'05 Proceedings of the First international conference on High Performance Embedded Architectures and Compilers
A geometric approach for partitioning n-dimensional non-rectangular iteration spaces

LCPC'04 Proceedings of the 17th international conference on Languages and Compilers for High Performance Computing
A study of performance scalability by parallelizing loop iterations on multi-core SMPs

ICA3PP'10 Proceedings of the 10th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Optimizing SDRAM bandwidth for custom FPGA loop accelerators

Proceedings of the ACM/SIGDA international symposium on Field Programmable Gate Arrays
Impact of array data flow analysis on the design of energy-efficient circuits

PATMOS'06 Proceedings of the 16th international conference on Integrated Circuit and System Design: power and Timing Modeling, Optimization and Simulation
Analysis of pure methods using garbage collection

Proceedings of the 2012 ACM SIGPLAN Workshop on Memory Systems Performance and Correctness
VMAD: an advanced dynamic program analysis and instrumentation framework

CC'12 Proceedings of the 21st international conference on Compiler Construction
Invariant relations, invariant functions, and loop functions

Innovations in Systems and Software Engineering
Optimizing chip multiprocessor work distribution using dynamic compilation

Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Sub-polyhedral scheduling using (unit-)two-variable-per-inequality polyhedra

POPL '13 Proceedings of the 40th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Fast condensation of the program dependence graph

Proceedings of the 34th ACM SIGPLAN conference on Programming language design and implementation
Online dynamic dependence analysis for speculative polyhedral parallelization

Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Fix the code. Don't tweak the hardware: A new compiler approach to Voltage-Frequency scaling

Proceedings of Annual IEEE/ACM International Symposium on Code Generation and Optimization

Quantified Score

Hi-index	0.01

Loop Transformations for Restructuring Compilers: The Foundations

Quantified Score

Visualization

Abstract