A parallel pipelined relational query processor
ACM Transactions on Database Systems (TODS)
On design principles for a molecular computer
Communications of the ACM
Bounds on Algorithm-Based Fault Tolerance in Multiple Processor Systems
IEEE Transactions on Computers - The MIT Press scientific computation series
Structure handling in data-flow systems
IEEE Transactions on Computers - The MIT Press scientific computation series
On input/output speedup in tightly coupled multiprocessors
IEEE Transactions on Computers - The MIT Press scientific computation series
Evaluating two massively parallel machines
Communications of the ACM
Interprocedural dependence analysis and parallelization
SIGPLAN '86 Proceedings of the 1986 SIGPLAN symposium on Compiler construction
IEEE Transactions on Computers
Memory requirements for balanced computer architectures
ISCA '86 Proceedings of the 13th annual international symposium on Computer architecture
Highly concurrent scalar processing
ISCA '86 Proceedings of the 13th annual international symposium on Computer architecture
Optimal pipelining in supercomputers
ISCA '86 Proceedings of the 13th annual international symposium on Computer architecture
The parallel complexity of exponentiating polynomials over finite fields
STOC '85 Proceedings of the seventeenth annual ACM symposium on Theory of computing
The program dependence graph and its use in optimization
ACM Transactions on Programming Languages and Systems (TOPLAS)
ACM SIGARCH Computer Architecture News
Size-time complexity of Boolean networks for prefix computations
STOC '87 Proceedings of the nineteenth annual ACM symposium on Theory of computing
Automatic translation of FORTRAN programs to vector form
ACM Transactions on Programming Languages and Systems (TOPLAS)
Fast temporary storage for serial and parallel execution
ISCA '87 Proceedings of the 14th annual international symposium on Computer architecture
A multiprocessor architecture for two-dimensional digital filters
IEEE Transactions on Computers
Processor Allocation for Horizontal and Vertical Parallelism and Related Speedup Bounds
IEEE Transactions on Computers
Guided self-scheduling: A practical scheduling scheme for parallel supercomputers
IEEE Transactions on Computers
Automatic decomposition of scientific programs for parallel execution
POPL '87 Proceedings of the 14th ACM SIGACT-SIGPLAN symposium on Principles of programming languages
The parallel complexity of exponentiating polynomials over finite fields
Journal of the ACM (JACM)
Compiler Optimizations for Enhancing Parallelism and Their Impact on Architecture Design
IEEE Transactions on Computers - Special issue on architectural support for programming languages and operating systems
A fetch-and-op implementation for parallel computers
ISCA '88 Proceedings of the 15th Annual International Symposium on Computer architecture
The program summary graph and flow-sensitive interprocedual data flow analysis
PLDI '88 Proceedings of the ACM SIGPLAN 1988 conference on Programming Language design and Implementation
IEEE Transactions on Computers
A framework for determining useful parallelism
ICS '88 Proceedings of the 2nd international conference on Supercomputing
Parallelization and performance evaluation of circuit simulation on a shared-memory multiprocessor
ICS '88 Proceedings of the 2nd international conference on Supercomputing
Hierarchical registers for scientific computers
ICS '88 Proceedings of the 2nd international conference on Supercomputing
ICS '88 Proceedings of the 2nd international conference on Supercomputing
Impact of self-scheduling order on performance on multiprocessor systems
ICS '88 Proceedings of the 2nd international conference on Supercomputing
Sensitivity analysis of reliability and performability measures for multiprocessor systems
SIGMETRICS '88 Proceedings of the 1988 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Size-time complexity of Boolean networks for prefix computations
Journal of the ACM (JACM)
Automatic discovery of parallelism: a tool and an experiment (extended abstract)
PPEALS '88 Proceedings of the ACM/SIGPLAN conference on Parallel programming: experience with applications, languages and systems
Efficient interprocedural analysis for program parallelization and restructuring
PPEALS '88 Proceedings of the ACM/SIGPLAN conference on Parallel programming: experience with applications, languages and systems
On the combination of hardware and software concurrency extraction methods
ACM SIGMICRO Newsletter
Compiling issues for supercomputers
Proceedings of the 1988 ACM/IEEE conference on Supercomputing
Performance-Measurement Tools in a Multiprocessor Environment
IEEE Transactions on Computers
Static analysis of low-level synchronization
PADD '88 Proceedings of the 1988 ACM SIGPLAN and SIGOPS workshop on Parallel and distributed debugging
SPAA '89 Proceedings of the first annual ACM symposium on Parallel algorithms and architectures
On the adequacy of program dependence graphs for representing programs
POPL '88 Proceedings of the 15th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Automatic generation of DAG parallelism
PLDI '89 Proceedings of the ACM SIGPLAN 1989 Conference on Programming language design and implementation
Partial order programming (extended abstract)
POPL '89 Proceedings of the 16th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
“Combining” as a compilation technique for VLIW architectures
MICRO 22 Proceedings of the 22nd annual workshop on Microprogramming and microarchitecture
Minimum Distance: A Method for Partitioning Recurrences for Multiprocessors
IEEE Transactions on Computers
The parascope editor: an interactive parallel programming tool
Proceedings of the 1989 ACM/IEEE conference on Supercomputing
Utilizing Multidimensional Loop Parallelism on Large Scale Parallel Processor Systems
IEEE Transactions on Computers
Region Scheduling: An Approach for Detecting and Redistributing Parallelism
IEEE Transactions on Software Engineering
IEEE Transactions on Computers
Area-Time Optimal Adder Design
IEEE Transactions on Computers
Data Dependency Graphs for Ada Programs
IEEE Transactions on Software Engineering
Improving register allocation for subscripted variables
PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
Compact representations for control dependence
PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
A foundation for sequentializing parallel code
SPAA '90 Proceedings of the second annual ACM symposium on Parallel algorithms and architectures
Automatic parallelization of APL-style programs
APL '90 Conference proceedings on APL 90: for the future
Analysis of event synchronization in a parallel programming tool
PPOPP '90 Proceedings of the second ACM SIGPLAN symposium on Principles & practice of parallel programming
Dependence flow graphs: an algebraic approach to program dependencies
POPL '91 Proceedings of the 18th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Compiler algorithms for event variable synchronization
ICS '91 Proceedings of the 5th international conference on Supercomputing
The hierarchical task graph and its use in auto-scheduling
ICS '91 Proceedings of the 5th international conference on Supercomputing
Analysis and transformation in the ParaScope editor
ICS '91 Proceedings of the 5th international conference on Supercomputing
Optimal schedules for parallel prefix computation with bounded resources
PPOPP '91 Proceedings of the third ACM SIGPLAN symposium on Principles and practice of parallel programming
Event-based performance perturbation: a case study
PPOPP '91 Proceedings of the third ACM SIGPLAN symposium on Principles and practice of parallel programming
A real introduction to supercomputing: a user training course
Proceedings of the 1990 ACM/IEEE conference on Supercomputing
A compiler-assisted approach to SPMD execution
Proceedings of the 1990 ACM/IEEE conference on Supercomputing
PLDI '91 Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation
An Improved Vector-Reduction Method
IEEE Transactions on Computers
Efficiently computing static single assignment form and the control dependence graph
ACM Transactions on Programming Languages and Systems (TOPLAS)
Techniques for debugging parallel programs with flowback analysis
ACM Transactions on Programming Languages and Systems (TOPLAS)
ACM SIGARCH Computer Architecture News - Symposium on parallel algorithms and architectures
A new technique for induction variable removal
MICRO 24 Proceedings of the 24th annual international symposium on Microarchitecture
Interprocedural transformations for parallel code generation
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Incremental tree height reduction for high level synthesis
DAC '91 Proceedings of the 28th ACM/IEEE Design Automation Conference
Unexpected side effects of inline substitution: a case study
ACM Letters on Programming Languages and Systems (LOPLAS)
PLDI '92 Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation
Automatic software cache coherence through vectorization
ICS '92 Proceedings of the 6th international conference on Supercomputing
Speedup of band linear recurrences in the presence of resource constraints
ICS '92 Proceedings of the 6th international conference on Supercomputing
Microarchitecture support for dynamic scheduling of acyclic task graphs
MICRO 25 Proceedings of the 25th annual international symposium on Microarchitecture
IEEE Transactions on Computers
Compiler blockability of numerical algorithms
Proceedings of the 1992 ACM/IEEE conference on Supercomputing
Compiler code transformations for superscalar-based high performance systems
Proceedings of the 1992 ACM/IEEE conference on Supercomputing
Using static single assignment form in a code optimizer
ACM Letters on Programming Languages and Systems (LOPLAS)
Parallelization and analysis of a linear adaptive filtering algorithm
SAC '93 Proceedings of the 1993 ACM/SIGAPP symposium on Applied computing: states of the art and practice
Complex Gaussian integers for “Gaussian graphics”
ACM SIGPLAN Notices
Advanced compiler optimizations for sparse computations
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
ICS '94 Proceedings of the 8th international conference on Supercomputing
Height reduction of control recurrences for ILP processors
MICRO 27 Proceedings of the 27th annual international symposium on Microarchitecture
Minimization of memory traffic in high-level synthesis
DAC '94 Proceedings of the 31st annual Design Automation Conference
Sequencer-based data path synthesis of regular iterative algorithms
DAC '94 Proceedings of the 31st annual Design Automation Conference
Improving the ratio of memory operations to floating-point operations in loops
ACM Transactions on Programming Languages and Systems (TOPLAS)
Compiler transformations for high-performance computing
ACM Computing Surveys (CSUR)
Parallelizing Iterative Loops with Conditional Branching
IEEE Transactions on Parallel and Distributed Systems
Symbolic array dataflow analysis for array privatization and program parallelization
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Vectorization beyond data dependences
ICS '95 Proceedings of the 9th international conference on Supercomputing
Critical path reduction for scalar programs
Proceedings of the 28th annual international symposium on Microarchitecture
Computing Programs Containing Band Linear Recurrences on Vector Supercomputers
IEEE Transactions on Parallel and Distributed Systems
Instruction scheduling for the HP PA-8000
Proceedings of the 29th annual ACM/IEEE international symposium on Microarchitecture
The Strict Time Lower Bound and Optimal Schedules for Parallel Prefix with Resource Constraints
IEEE Transactions on Computers
Optimization on instruction reorganization
MICRO 23 Proceedings of the 23rd annual workshop and symposium on Microprogramming and microarchitecture
On the combination of hardware and software concurrency extraction methods
MICRO 20 Proceedings of the 20th annual workshop on Microprogramming
Compiler blockability of dense matrix factorizations
ACM Transactions on Mathematical Software (TOMS)
ICS '98 Proceedings of the 12th international conference on Supercomputing
An Efficient Solution to the Cache Thrashing Problem Caused by True Data Sharing
IEEE Transactions on Computers
The program decision logic approach to predicated execution
ISCA '99 Proceedings of the 26th annual international symposium on Computer architecture
Control CPR: a branch height reduction optimization for EPIC architectures
Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
New tiling techniques to improve cache temporal locality
Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
The composite binary cube — a family of interconnection networks for multiprocessors
ICS '89 Proceedings of the 3rd international conference on Supercomputing
Constraint based vectorization
ICS '89 Proceedings of the 3rd international conference on Supercomputing
Performance analysis of buffered banyan networks under nonuniform traffic
ICS '89 Proceedings of the 3rd international conference on Supercomputing
Supercomputers and distributed computing
CSC '85 Proceedings of the 1985 ACM thirteenth annual conference on Computer Science
The Area-Time Complexity of Binary Multiplication
Journal of the ACM (JACM)
A new class of parallel algorithms for solving linear tridiagonal systems
ACM '86 Proceedings of 1986 ACM Fall joint computer conference
Performance of three dataflow computers
CSC '86 Proceedings of the 1986 ACM fourteenth annual conference on Computer science
A technique for reducing synchronization overhead in large scale multiprocessors
ISCA '85 Proceedings of the 12th annual international symposium on Computer architecture
A Space-Time Representation Method of Iterative Algorithms for the Design of Processor Arrays
Journal of VLSI Signal Processing Systems
Efficient Interprocedural Array Data-Flow Analysis for Automatic Program Parallelization
IEEE Transactions on Software Engineering - Special issue on architecture-independent languages and software tools for parallel processing
Undecidability of context-sensitive data-dependence analysis
ACM Transactions on Programming Languages and Systems (TOPLAS)
Generation of Efficient Nested Loops from Polyhedra
International Journal of Parallel Programming - Special issue on instruction-level parallelism and parallelizing compilation, part 2
Classification Categories and Historical Development of Circuit Switching Topologies
ACM Computing Surveys (CSUR)
Data locality enhancement by memory reduction
ICS '01 Proceedings of the 15th international conference on Supercomputing
Compiler optimizations for scalable parallel systems
A program form based on data dependency in predicate regions
POPL '83 Proceedings of the 10th ACM SIGACT-SIGPLAN symposium on Principles of programming languages
Dependence graphs and compiler optimizations
POPL '81 Proceedings of the 8th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Automatic intra-register vectorization for the Intel architecture
International Journal of Parallel Programming
Symbolic algebra and timing driven data-flow synthesis
Proceedings of the 2001 IEEE/ACM international conference on Computer-aided design
Automatic Intra-Register Vectorization for the Intel® Architecture
International Journal of Parallel Programming
Multiprocessor Synchronization for Concurrent Loops
IEEE Software
A Reduced-Area Scheme for Carry-Select Adders
IEEE Transactions on Computers
An Empirical Study of Fortran Programs for Parallelizing Compilers
IEEE Transactions on Parallel and Distributed Systems
Interactive Parallel Programming using the ParaScope Editor
IEEE Transactions on Parallel and Distributed Systems
Design and Analysis of a Scalable Cache Coherence Scheme Based on Clocks and Timestamps
IEEE Transactions on Parallel and Distributed Systems
Automatic Extraction of Functional Parallelism from Ordinary Programs
IEEE Transactions on Parallel and Distributed Systems
Partitioning and Labeling of Loops by Unimodular Transformations
IEEE Transactions on Parallel and Distributed Systems
Access Graphs: A Model for Investigating Memory Consistency
IEEE Transactions on Parallel and Distributed Systems
Balancing Fine- and Medium-Grained Parallelism in Scheduling Loops for the XIMD Architecture
PACT '93 Proceedings of the IFIP WG10.3. Working Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism
A hierarchical basis for reordering transformations
POPL '84 Proceedings of the 11th ACM SIGACT-SIGPLAN symposium on Principles of programming languages
The mimola design system: Tools for the design of digital processors
DAC '84 Proceedings of the 21st Design Automation Conference
The Boolean Vector Machine [BVM]
ISCA '83 Proceedings of the 10th annual international symposium on Computer architecture
Online pipeline systems for recursive numeric computations
ISCA '80 Proceedings of the 7th annual symposium on Computer Architecture
A retargetable microcode generation system for a high-level microprogramming language
MICRO 14 Proceedings of the 14th annual workshop on Microprogramming
Heuristics for the global optimization of microprograms
MICRO 13 Proceedings of the 13th annual workshop on Microprogramming
A pipelined processing unit for on-line division
ISCA '78 Proceedings of the 5th annual symposium on Computer architecture
Automatic design with dependence graphs
DAC '80 Proceedings of the 17th Design Automation Conference
The chip complexity of binary arithmetic
STOC '80 Proceedings of the twelfth annual ACM symposium on Theory of computing
Data broadcasting in linearly scheduled array processors
ISCA '84 Proceedings of the 11th annual international symposium on Computer architecture
Fault-secure algorithms for multiple-processor systems
ISCA '84 Proceedings of the 11th annual international symposium on Computer architecture
Simulation of aerodynamics problem on a distributed shared-memory machine
HPC-ASIA '97 Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
Extracting Parallelism in Nested Loops
COMPSAC '96 Proceedings of the 20th Conference on Computer Software and Applications
A design representation for high level synthesis
EURO-DAC '90 Proceedings of the conference on European design automation
An experimental evaluation of scalar replacement on scientific benchmarks
Software—Practice & Experience
Interprocedural dependence analysis and parallelization
ACM SIGPLAN Notices - Best of PLDI 1979-1999
Improving register allocation for subscripted variables
ACM SIGPLAN Notices - Best of PLDI 1979-1999
A unified framework for nonlinear dependence testing and symbolic analysis
Proceedings of the 18th annual international conference on Supercomputing
Code Generation in the Polyhedral Model Is Easier Than You Think
Proceedings of the 13th International Conference on Parallel Architectures and Compilation Techniques
Encyclopedia of Computer Science
A polynomial-time algorithm for memory space reduction
International Journal of Parallel Programming
Challenges in exploitation of loop parallelism in embedded applications
CODES+ISSS '06 Proceedings of the 4th international conference on Hardware/software codesign and system synthesis
Lightweight lock-free synchronization methods for multithreading
Proceedings of the 20th annual international conference on Supercomputing
The Measurement of Performance on a Highly Parallel System
IEEE Transactions on Computers
A Regular Layout for Parallel Adders
IEEE Transactions on Computers
A Basis for the Quantitative Comparison of Computer Number Systems
IEEE Transactions on Computers
The Burroughs Scientific Processor (BSP)
IEEE Transactions on Computers
Supersystems: Technology and Architecture
IEEE Transactions on Computers
On the Performance Enhancement of Paging Systems Through Program Analysis and Transformations
IEEE Transactions on Computers
Divide-and-Conquer for Parallel Processing
IEEE Transactions on Computers
High-Speed Multiprocessors and Compilation Techniques
IEEE Transactions on Computers
The Expression Processor: A Pipelined, Multiple- Processor Architecture
IEEE Transactions on Computers
Characterizing Computers and Optimizing the FACR(l) Poisson-Solver on Parallel Unicomputers
IEEE Transactions on Computers
Comparing Serial Computers, Arrays, and Networks Using Measures of "Active Resources"
IEEE Transactions on Computers
Minimization of Interprocessor Communication for Parallel Computation
IEEE Transactions on Computers
Advanced collective communication in aspen
Proceedings of the 22nd annual international conference on Supercomputing
On the exploitation of loop-level parallelism in embedded applications
ACM Transactions on Embedded Computing Systems (TECS)
Specifying target resources in a machine independent higher level language
AFIPS '81 Proceedings of the May 4-7, 1981, national computer conference
Image understanding architectures
AFIPS '80 Proceedings of the May 19-22, 1980, national computer conference
Techniques for efficient placement of synchronization primitives
Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming
Timing-driven N-way decomposition
Proceedings of the 19th ACM Great Lakes symposium on VLSI
Synchronization optimizations for efficient execution on multi-cores
Proceedings of the 23rd international conference on Supercomputing
On approximating the ideal random access machine by physical machines
Journal of the ACM (JACM)
An area-time efficient NMOS adder
Integration, the VLSI Journal
The effect of multiprocessor radius on scaling
Parallel Computing
On control signals for multi-dimensional time
LCPC'06 Proceedings of the 19th international conference on Languages and compilers for parallel computing
Improving data locality by chunking
CC'03 Proceedings of the 12th international conference on Compiler construction
Continuous signature monitoring: efficient concurrent-detection of processor control errors
ITC'88 Proceedings of the 1988 international conference on Test: new frontiers in testing
Efficient code generation for automatic parallelization and optimization
ISPDC'03 Proceedings of the Second international conference on Parallel and distributed computing
How many threads to spawn during program multithreading?
LCPC'10 Proceedings of the 23rd international conference on Languages and compilers for parallel computing
An improved DNA-sticker addition algorithm and its application to logarithmic arithmetic
DNA'11 Proceedings of the 17th international conference on DNA computing and molecular programming
Identifying hotspots in a program for data parallel architecture: an early experience
Proceedings of the 5th India Software Engineering Conference
Automatic detection of saturation and clipping idioms
LCPC'02 Proceedings of the 15th international conference on Languages and Compilers for Parallel Computing
Polyhedral code generation in the real world
CC'06 Proceedings of the 15th international conference on Compiler Construction
Hi-index | 0.11 |