Uniform techniques for loop optimization
ICS '91 Proceedings of the 5th international conference on Supercomputing
Software pipelining: an evaluation of enhanced pipelining
MICRO 24 Proceedings of the 24th annual international symposium on Microarchitecture
Efficient DAG construction and heuristic calculation for instruction scheduling
MICRO 24 Proceedings of the 24th annual international symposium on Microarchitecture
Eliminating false data dependences using the Omega test
PLDI '92 Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation
PLDI '92 Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation
ICS '92 Proceedings of the 6th international conference on Supercomputing
Access normalization: loop restructuring for NUMA compilers
ASPLOS V Proceedings of the fifth international conference on Architectural support for programming languages and operating systems
Non-unimodular transformations of nested loops
Proceedings of the 1992 ACM/IEEE conference on Supercomputing
The performance realities of massively parallel processors: a case study
Proceedings of the 1992 ACM/IEEE conference on Supercomputing
Low copy message passing on the Alliant CAMPUS/800
Proceedings of the 1992 ACM/IEEE conference on Supercomputing
ACM Letters on Programming Languages and Systems (LOPLAS)
Definitions of dependence distance
ACM Letters on Programming Languages and Systems (LOPLAS)
Models of machines and computation for mapping in multicomputers
ACM Computing Surveys (CSUR)
Access normalization: loop restructuring for NUMA computers
ACM Transactions on Computer Systems (TOCS)
Advanced compiler optimizations for sparse computations
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
High performance Fortran: implementor and users workshop
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
A general data dependence test for dynamic, pointer-based data structures
PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
A model for dataflow based vector execution
ICS '94 Proceedings of the 8th international conference on Supercomputing
ICS '94 Proceedings of the 8th international conference on Supercomputing
Reducing data communication overhead for DOACROSS loop nests
ICS '94 Proceedings of the 8th international conference on Supercomputing
ICS '94 Proceedings of the 8th international conference on Supercomputing
Compiler and runtime support for out-of-core HPF programs
ICS '94 Proceedings of the 8th international conference on Supercomputing
The definition of dependence distance
ACM Transactions on Programming Languages and Systems (TOPLAS)
Static analysis of upper and lower bounds on dependences and parallelism
ACM Transactions on Programming Languages and Systems (TOPLAS)
Parallelizing Subroutines in Sequential Programs
IEEE Software
Compilation of out-of-core data parallel programs for distributed memory machines
ACM SIGARCH Computer Architecture News - Special issue on input/output in parallel computer systems
Fusing loops with backward inter loop data dependence
ACM SIGPLAN Notices
Going Beyond Integer Programming with the Omega Test to Eliminate False Data Dependences
IEEE Transactions on Parallel and Distributed Systems
PLDI '95 Proceedings of the ACM SIGPLAN 1995 conference on Programming language design and implementation
An array operation synthesis scheme to optimize Fortran 90 programs
PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
ACM Computing Surveys (CSUR)
Run-time methods for parallelizing partially parallel loops
ICS '95 Proceedings of the 9th international conference on Supercomputing
Optimal tile size adjustment in compiling general DOACROSS loop nests
ICS '95 Proceedings of the 9th international conference on Supercomputing
Vectorization beyond data dependences
ICS '95 Proceedings of the 9th international conference on Supercomputing
Handling block-cyclic distributed arrays in Vienna Fortran 90
PACT '95 Proceedings of the IFIP WG10.3 working conference on Parallel architectures and compilation techniques
Towards a structural load testing tool
ISSTA '96 Proceedings of the 1996 ACM SIGSOFT international symposium on Software testing and analysis
IEEE Transactions on Parallel and Distributed Systems
Automatic Data Structure Selection and Transformation for Sparse Matrix Computations
IEEE Transactions on Parallel and Distributed Systems
ACM Computing Surveys (CSUR)
An Implementation Framework for HPF Distributed Arrays on Message-Passing Parallel Computer Systems
IEEE Transactions on Parallel and Distributed Systems
Detection and global optimization of reduction operations for distributed parallel machines
ICS '96 Proceedings of the 10th international conference on Supercomputing
Decomposition Abstraction in Parallel Rule Languages
IEEE Transactions on Parallel and Distributed Systems
Joint Minimization of Code and Data for Synchronous DataflowPrograms
Formal Methods in System Design
Automatic parallelization of object-oriented programming languages using tuple space
CSC '95 Proceedings of the 1995 ACM 23rd annual conference on Computer science
Incorporating application dependent information in an automatic code generating environment
ICS '97 Proceedings of the 11th international conference on Supercomputing
Correctness and parallelism in composite systems
PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Alias analysis of executable code
POPL '98 Proceedings of the 25th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
On the Removal of Anti- and Output-Dependences
International Journal of Parallel Programming
The automatic generation of sparse primitives
ACM Transactions on Mathematical Software (TOMS)
Constraint-based array dependence analysis
ACM Transactions on Programming Languages and Systems (TOPLAS)
IEEE Transactions on Parallel and Distributed Systems
Code motion for explicitly parallel programs
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
An Object-Oriented Framework for Loop Parallelization
The Journal of Supercomputing
Accelerating APL programs with SAC
Proceedings of the conference on APL '99 : On track to the 21st century: On track to the 21st century
Combining structural and procedural programming by parallelizing compilation
SAC '95 Proceedings of the 1995 ACM symposium on Applied computing
APL '98 Proceedings of the APL98 conference on Array processing language
A global communication optimization technique based on data-flow analysis and linear algebra
ACM Transactions on Programming Languages and Systems (TOPLAS)
Adaptive reduction parallelization techniques
Proceedings of the 14th international conference on Supercomputing
An Interleaving Transformation for Parallelizing Reductions for Distributed-Memory Parallel Machines
The Journal of Supercomputing
Scenario-Based Hypersequential Programming
International Journal of Parallel Programming - Special issue on international symposium on high performance computing 1997, part I
A Unified Symbolic Evaluation Framework for Parallelizing Compilers
IEEE Transactions on Parallel and Distributed Systems
Improving Memory Traffic by Assembly-Level Exploitation of Reuses for Vector Registers
The Journal of Supercomputing
Reducing Parallel Program Simulation Complexity by Static Analysis
The Journal of Supercomputing
Loop parallelization algorithms
Compiler optimizations for scalable parallel systems
Compiler optimizations for scalable parallel systems
Communication-free partitioning of nested loops
Compiler optimizations for scalable parallel systems
Automatic partitioning and virtual scheduling for efficient parallel execution
ACM-SE 30 Proceedings of the 30th annual Southeast regional conference
Parallel execution of prolog programs: a survey
ACM Transactions on Programming Languages and Systems (TOPLAS)
Static and Dynamic Locality Optimizations Using Integer Linear Programming
IEEE Transactions on Parallel and Distributed Systems
Automatic Code Mapping on an Intelligent Memory Architecture
IEEE Transactions on Computers
Automatic data and computation decomposition on distributed memory parallel computers
ACM Transactions on Programming Languages and Systems (TOPLAS)
Speculative dynamic vectorization
ISCA '02 Proceedings of the 29th annual international symposium on Computer architecture
Compile Time Barrier Synchronization Minimization
IEEE Transactions on Parallel and Distributed Systems
Automatic intra-register vectorization for the Intel architecture
International Journal of Parallel Programming
Synthesis of Embedded Software from Synchronous Dataflow Specifications
Journal of VLSI Signal Processing Systems
Correctness properties in a shared-memory parallel language
Journal of the ACM (JACM)
Compilation Techniques for Multimedia Processors
International Journal of Parallel Programming
A Vectorizing Compiler for Multimedia Extensions
International Journal of Parallel Programming
Automatic Intra-Register Vectorization for the Intel® Architecture
International Journal of Parallel Programming
Supporting Irregular Distributions Using Data-Parallel Languages
IEEE Parallel & Distributed Technology: Systems & Technology
Hypersequential Programming: A New Way to Develop Concurrent Programs
IEEE Parallel & Distributed Technology: Systems & Technology
Two Program Comprehension Tools for Automatic Parallelization
IEEE Concurrency
Loop-Level Parallelism in Numeric and Symbolic Programs
IEEE Transactions on Parallel and Distributed Systems
Backtracking in Independent And-Parallel Implementations of Logic Programming Languages
IEEE Transactions on Parallel and Distributed Systems
Region Analysis: A Parallel Elimination Method for Data Flow Analysis
IEEE Transactions on Software Engineering
Measuring Design-Level Cohesion
IEEE Transactions on Software Engineering
Parallelizing graph construction operations in programs with cyclic graphs
Parallel Computing
Precise Call Graph Construction for OO Programs in the Presence of Virtual Functions
ICPP '97 Proceedings of the international Conference on Parallel Processing
Sassy: A Language and Optimizing Compiler for Image Processing on Reconfigurable Computing Systems
ICVS '99 Proceedings of the First International Conference on Computer Vision Systems
A Partitioning Programming Environment for a Novel Parallel Architecture
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Semantics and Implementation of a Generalized forall Statement for Parallel Languages
IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
Derivation of Safety Requirements for Safety Analysis of Object-Oriented Design Documents
COMPSAC '97 Proceedings of the 21st International Computer Software and Applications Conference
Parameter-Induced Aliasing in Ada
Ada Europe '01 Proceedings of the 6th Ade-Europe International Conference Leuven on Reliable Software Technologies
Tiling and Memory Reuse for Sequences of Nested Loops
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
A Generalized Transaction Theory for Database and Non-database Tasks
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
Scheduling Iterative Programs onto LogP-Machine
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
Balancing Fine- and Medium-Grained Parallelism in Scheduling Loops for the XIMD Architecture
PACT '93 Proceedings of the IFIP WG10.3. Working Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism
Flow-sensitive Interprocedural Analysis Method for Parallelization
PACT '93 Proceedings of the IFIP WG10.3. Working Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism
Sparse Jacobian Computation in Automatic Differentiation by Static Program Analysis
SAS '98 Proceedings of the 5th International Symposium on Static Analysis
Inter-procedural Analysis for Parallelization of Java Programs
ParNum '99 Proceedings of the 4th International ACPC Conference Including Special Tracks on Parallel Numerics and Parallel Computing in Image Processing, Video Processing, and Multimedia: Parallel Computation
Teraflops Computing: A Challenge to Parallel Numerics?
ParNum '99 Proceedings of the 4th International ACPC Conference Including Special Tracks on Parallel Numerics and Parallel Computing in Image Processing, Video Processing, and Multimedia: Parallel Computation
Efficient Symbolic Analysis for Optimizing Compilers
CC '01 Proceedings of the 10th International Conference on Compiler Construction
Compiler Transformation of Pointers to Explicit Array Accesses in DSP Applications
CC '01 Proceedings of the 10th International Conference on Compiler Construction
A Technique for FPGA Synthesis Driven by Automatic Source Code Analysis and Transformations
FPL '02 Proceedings of the Reconfigurable Computing Is Going Mainstream, 12th International Conference on Field-Programmable Logic and Applications
A Case Study: Effects of WITH-Loop-Folding on the NAS Benchmark MG in SAC
IFL '98 Selected Papers from the 10th International Workshop on 10th International Workshop
Nested Parallel Call Optimization
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
An Efficient Technique of Instruction Scheduling on a Superscalar-Based Mulprocessor
IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
Cluster computing with message-passing interface
Highly parallel computaions
Supporting procedural constructs in existing SQL compilers
CASCON '96 Proceedings of the 1996 conference of the Centre for Advanced Studies on Collaborative research
Optimized software synthesis for synchronous dataflow
ASAP '97 Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures and Processors
Compiler-assisted generation of error-detecting parallel programs
FTCS '96 Proceedings of the The Twenty-Sixth Annual International Symposium on Fault-Tolerant Computing (FTCS '96)
A Loop Transformation for Maximizing Parallelism from Single Loops with Nonuniform Dependencies
HPC-ASIA '97 Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
A transformation method to reduce loop overhead in HPF compiler
HPC-ASIA '97 Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
Visualization and sonification of methods
PAS '95 Proceedings of the First Aizu International Symposium on Parallel Algorithms/Architecture Synthesis
Supercompilers for massively parallel architectures
PAS '95 Proceedings of the First Aizu International Symposium on Parallel Algorithms/Architecture Synthesis
Hardware/Software Co-Design for Data-Driven Xputer-based Accelerators
VLSID '97 Proceedings of the Tenth International Conference on VLSI Design: VLSI in Multimedia Applications
Identifying parallelism in programs with cyclic graphs
Journal of Parallel and Distributed Computing
On reducing the sensitivity of software reliability to variations in the operational profile
ISSRE '96 Proceedings of the The Seventh International Symposium on Software Reliability Engineering
Design-Level Cohesion Measures: Derivation, Comparison, and Applications
COMPSAC '96 Proceedings of the 20th Conference on Computer Software and Applications
Extracting Parallelism in Nested Loops
COMPSAC '96 Proceedings of the 20th Conference on Computer Software and Applications
Automatic generation of application specific processors
Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems
Parallel Processing of First Order Linear Recurrence on SMP Machines
The Journal of Supercomputing
What can we gain by unfolding loops?
ACM SIGPLAN Notices
Single Assignment C: efficient support for high-level array operations in a functional setting
Journal of Functional Programming
Linear data distribution based on index analysis
High performance scientific and engineering computing
Vectorization for SIMD architectures with alignment constraints
Proceedings of the ACM SIGPLAN 2004 conference on Programming language design and implementation
A unified framework for nonlinear dependence testing and symbolic analysis
Proceedings of the 18th annual international conference on Supercomputing
Compiler based exploration of DSP energy savings by SIMD operations
Proceedings of the 2004 Asia and South Pacific Design Automation Conference
Retargeting Sequential Image-Processing Programs for Data Parallel Execution
IEEE Transactions on Software Engineering
Efficient SIMD Code Generation for Runtime Alignment and Length Conversion
Proceedings of the international symposium on Code generation and optimization
Exploitation of parallelism to nested loops with dependence cycles
Journal of Systems Architecture: the EUROMICRO Journal
Shared memory multiprocessor support for functional array processing in SAC
Journal of Functional Programming
Improving superword level parallelism support in modern compilers
CODES+ISSS '05 Proceedings of the 3rd IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
An integrated simdization framework using virtual vectors
Proceedings of the 19th annual international conference on Supercomputing
Optimizing Compiler for the CELL Processor
Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques
A quantitative study and estimation models for extensible instructions in embedded processors
Proceedings of the 2004 IEEE/ACM International conference on Computer-aided design
A New Approach to Parallelization of Serial Nested Loops Using Genetic Algorithms
The Journal of Supercomputing
A New Genetic Algorithm for Loop Tiling
The Journal of Supercomputing
An Adaptive Algorithm Selection Framework for Reduction Parallelization
IEEE Transactions on Parallel and Distributed Systems
SAC: a functional array language for efficient multi-threaded execution
International Journal of Parallel Programming
Proceedings of the 44th annual Southeast regional conference
FFT program generation for shared memory: SMP and multicore
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Journal of Parallel and Distributed Computing
Incorporating Intel® MMX$^{\rm TM}$ technology into a Java$^{\rm TM}$ JIT compiler$^{1}$
Scientific Programming
Data Flow Analysis as a General Concept for the Transport of Verifiable Program Annotations
Electronic Notes in Theoretical Computer Science (ENTCS)
Buffer and Register Allocation for Memory Space Optimization
Journal of VLSI Signal Processing Systems
Quantifying ILP by means of graph theory
Proceedings of the 2nd international conference on Performance evaluation methodologies and tools
Program optimization space pruning for a multithreaded gpu
Proceedings of the 6th annual IEEE/ACM international symposium on Code generation and optimization
Foundations for the integration of scheduling techniques into compilers for parallel languages
International Journal of Computational Science and Engineering
From FORTRAN 77 to locality-aware high productivity languages for peta-scale computing
Scientific Programming - Fortran Programming Language and Scientific Programming: 50 Years of Mutual Growth
Software Tools for High-Performance Computiing: Survey and Recommendations
Scientific Programming
Pattern-Driven Automatic Parallelization
Scientific Programming
Scientific Programming
Program optimization carving for GPU computing
Journal of Parallel and Distributed Computing
Exploiting SIMD Parallelism with the CGiS Compiler Framework
Languages and Compilers for Parallel Computing
Parallelizing scientific code with invasive interactive parallelization: a case study with reuseware
Proceedings of the 2008 compFrame/HPC-GECO workshop on Component based high performance
On the exploitation of loop-level parallelism in embedded applications
ACM Transactions on Embedded Computing Systems (TECS)
Modulo scheduling without overlapped lifetimes
Proceedings of the 2009 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems
OpenDF: a dataflow toolset for reconfigurable hardware and multicore systems
ACM SIGARCH Computer Architecture News
A case study on compiler optimizations for the Intel® Core™ 2 duo processor
International Journal of Parallel Programming
Adaptive Fault Tolerance for Scalable Cluster Computing in Space
International Journal of High Performance Computing Applications
Automatic parallelization for graphics processing units
PPPJ '09 Proceedings of the 7th International Conference on Principles and Practice of Programming in Java
Parallel loop generation and scheduling
The Journal of Supercomputing
Optimal interprocedural program optimization: a new framework and its application
Optimal interprocedural program optimization: a new framework and its application
Loop parallelization in multi-dimensional cartesian space
PSI'06 Proceedings of the 6th international Andrei Ershov memorial conference on Perspectives of systems informatics
Hierarchical automatic differentiation by vertex elimination and source transformation
ICCSA'03 Proceedings of the 2003 international conference on Computational science and its applications: PartII
Advanced symbolic analysis for compilers: new techniques and algorithms for symbolic program analysis and optimization
Generating SIMD vectorized permutations
CC'08/ETAPS'08 Proceedings of the Joint European Conferences on Theory and Practice of Software 17th international conference on Compiler construction
How to unleash array optimizations on code using recursive data structures
Proceedings of the 24th ACM International Conference on Supercomputing
An OpenCL framework for heterogeneous multicores with local memory
Proceedings of the 19th international conference on Parallel architectures and compilation techniques
Adaptive fault tolerance for many-core based space-borne computing
Euro-Par'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II
RUBASTEM: a method for testing VHDL behavioral models
HASE'04 Proceedings of the Eighth IEEE international conference on High assurance systems engineering
A performance simulation technique for distributed programs: application to an SOR iterative solver
EURO-PDP'00 Proceedings of the 8th Euromicro conference on Parallel and distributed processing
Data dependence analysis for intra-register vectorization
ISPDC'03 Proceedings of the Second international conference on Parallel and distributed computing
Overview of the MPEG Reconfigurable Video Coding Framework
Journal of Signal Processing Systems
Automatic SIMD vectorization of fast fourier transforms for the larrabee and AVX instruction sets
Proceedings of the international conference on Supercomputing
The Journal of Supercomputing
A new carried-dependence self-scheduling algorithm
ICCSA'05 Proceedings of the 2005 international conference on Computational Science and its Applications - Volume Part I
Parallelisation of sequential programs by invasive composition and aspect weaving
APPT'05 Proceedings of the 6th international conference on Advanced Parallel Processing Technologies
Automatic translation of serial to distributed code using CORBA event channels
ISCIS'05 Proceedings of the 20th international conference on Computer and Information Sciences
Towards adaptive migration strategies for mobile agents
WRAC'05 Proceedings of the Second international conference on Radical Agent Concepts: innovative Concepts for Autonomic and Agent-Based Systems
Programming language concepts for multimedia application development
JMLC'06 Proceedings of the 7th joint conference on Modular Programming Languages
Symbolic analysis of imperative programming languages
JMLC'06 Proceedings of the 7th joint conference on Modular Programming Languages
Automatic distribution of sequential code using javasymphony middleware
SOFSEM'06 Proceedings of the 32nd conference on Current Trends in Theory and Practice of Computer Science
Compiler technology for blue gene systems
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
Automatically tuned FFTs for bluegene/l's double FPU
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
On dependence analysis for SIMD enhanced processors
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Experiments with auto-parallelizing SPEC2000FP benchmarks
LCPC'04 Proceedings of the 17th international conference on Languages and Compilers for High Performance Computing
Automatic detection of saturation and clipping idioms
LCPC'02 Proceedings of the 15th international conference on Languages and Compilers for Parallel Computing
Parallel reductions: an application of adaptive algorithm selection
LCPC'02 Proceedings of the 15th international conference on Languages and Compilers for Parallel Computing
Automatic scoping of variables in parallel regions of an OpenMP program
WOMPAT'04 Proceedings of the 5th international conference on OpenMP Applications and Tools: shared Memory Parallel Programming with OpenMP
Analysis of pure methods using garbage collection
Proceedings of the 2012 ACM SIGPLAN Workshop on Memory Systems Performance and Correctness
C2FPGA-A dependency-timing graph design methodology
Journal of Parallel and Distributed Computing
Loop Transforming for Reducing Data Alignment on Multi-Core SIMD Processors
Journal of Signal Processing Systems
Hi-index | 0.00 |