Ultracomputers

Authors:
Jacob T. Schwartz
Affiliations:
Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY
Venue:
ACM Transactions on Programming Languages and Systems (TOPLAS)
Year:
1980

Citing 62
Cited 88

Algorithms for Parallel-Search Memories

Journal of the ACM (JACM)
On the Time Required to Perform Addition

Journal of the ACM (JACM)
Large Parallel Computers

Journal of the ACM (JACM)
The Organization of Computations for Uniform Recurrence Equations

Journal of the ACM (JACM)
Application of Parallel Processing to Numerical Weather Prediction

Journal of the ACM (JACM)
Matrix Inversion Using Parallel Processing

Journal of the ACM (JACM)
On the Time Required to Perform Multiplication

Journal of the ACM (JACM)
A Permutation Network

Journal of the ACM (JACM)
Structuring of Parallel Algorithms

Journal of the ACM (JACM)
An Adaptation of the Fast Fourier Transform for Parallel Processing

Journal of the ACM (JACM)
Scheduling Parallel Computations

Journal of the ACM (JACM)
An Efficient Parallel Algorithm for the Solution of a Tridiagonal Linear System of Equations

Journal of the ACM (JACM)
The Parallel Evaluation of General Arithmetic Expressions

Journal of the ACM (JACM)
Bounds to Complexities of Networks for Sorting and for Switching

Journal of the ACM (JACM)
New Algorithms and Lower Bounds for the Parallel Evaluation of Certain Rational Expressions and Recurrences

Journal of the ACM (JACM)
The Complexity of Parallel Evaluation of Linear Recurrences

Journal of the ACM (JACM)
On Stable Parallel Linear System Solvers

Journal of the ACM (JACM)
Parallel permutation and sorting algorithms and a new generalized connection network

Journal of the ACM (JACM)
Parallel Tridiagonal Equation Solvers

ACM Transactions on Mathematical Software (TOMS)
The Solution of Tridiagonal Linear Systems on the CDC STAR 100 Computer

ACM Transactions on Mathematical Software (TOMS)
Computer Interconnection Structures: Taxonomy, Characteristics, and Examples

ACM Computing Surveys (CSUR)
Fast parallel sorting algorithms

Communications of the ACM
Glypnir—a programming language for Illiac IV

Communications of the ACM
The parallel execution of DO loops

Communications of the ACM
Parallelism in tape-sorting

Communications of the ACM
Merging with parallel processors

Communications of the ACM
An efficient data structure for dynamic memory management

Journal of Systems and Software
On the time required for a sequence of matrix products

Communications of the ACM
Parallel numerical methods for the solution of equations

Communications of the ACM
A case study in programming for parallel-processors

Communications of the ACM
Parallel methods for integrating ordinary differential equations

Communications of the ACM
A preliminary architecture for a basic data-flow processor

ISCA '75 Proceedings of the 2nd annual symposium on Computer architecture
Looking for an approach to a theory of models for parallel computation

Proceedings of the International Sympoisum on Theoretical Programming
Towards automtical construction of parallel programs

Proceedings of the International Sympoisum on Theoretical Programming
Study of multistage SIMD interconnection networks

ISCA '78 Proceedings of the 5th annual symposium on Computer architecture
A hierarchical, restructurable multi-microprocessor architecture

ISCA '76 Proceedings of the 3rd annual symposium on Computer architecture
Efficient message routing in Mega-Micro-Computer networks

ISCA '76 Proceedings of the 3rd annual symposium on Computer architecture
Parallel algorithms for the transitive closure and the connected component problems

STOC '76 Proceedings of the eighth annual ACM symposium on Theory of computing
Sorting on a mesh-connected parallel computer

STOC '76 Proceedings of the eighth annual ACM symposium on Theory of computing
The complexity of parallel evaluation of linear recurrence

STOC '75 Proceedings of seventh annual ACM symposium on Theory of computing
Interconnecting a distributed processor system for avionics

ISCA '73 Proceedings of the 1st annual symposium on Computer architecture
Banyan networks for partitioning multiprocessor systems

ISCA '73 Proceedings of the 1st annual symposium on Computer architecture
System balance analysis for vector computers

ACM '75 Proceedings of the 1975 annual conference
The universality of various types of SIMD machine interconnection networks

ISCA '77 Proceedings of the 4th annual symposium on Computer architecture
A large scale, homogeneous, fully distributed parallel machine, I

ISCA '77 Proceedings of the 4th annual symposium on Computer architecture
A Large Scale, Homogenous, Fully Distributed Parallel Machine, II

ISCA '77 Proceedings of the 4th annual symposium on Computer architecture
A GRAPH MODEL FOR PARALLEL COMPUTATIONS

A GRAPH MODEL FOR PARALLEL COMPUTATIONS
AN ANALYSIS OF SORTING NETWORKS

AN ANALYSIS OF SORTING NETWORKS
A survey of models for parallel computing

A survey of models for parallel computing
A generalization of the divide-sort-merge strategy for sorting networks

A generalization of the divide-sort-merge strategy for sorting networks
A lower bound for sorting networks that use the divide-sort-merge strategy

A lower bound for sorting networks that use the divide-sort-merge strategy
Large [g,d] sorting networks

Large [g,d] sorting networks
Parallel solution methods for triangular linear systems of equations

Parallel solution methods for triangular linear systems of equations
Parallelism exposure and exploitation in programs

Parallelism exposure and exploitation in programs
Memory-processor connection networks

Memory-processor connection networks
Speedup of iterative programs in multiprocessing systems.

Speedup of iterative programs in multiprocessing systems.
Interprocessor connections--capabilities, exploitation and effectiveness.

Interprocessor connections--capabilities, exploitation and effectiveness.
Parallel graph processing using depth-first search and breadth-first search.

Parallel graph processing using depth-first search and breadth-first search.
A study of parallelism in graph theory.

A study of parallelism in graph theory.
A universal computer capable of executing an arbitrary number of sub-programs simultaneously

IRE-AIEE-ACM '59 (Eastern) Papers presented at the December 1-3, 1959, eastern joint IRE-AIEE-ACM computer conference
Intercommunication of processors and memory

AFIPS '67 (Fall) Proceedings of the November 14-16, 1967, fall joint computer conference
Interconnection networks: a survey and assessment

AFIPS '74 Proceedings of the May 6-10, 1974, national computer conference and exposition

Efficient Schemes for Parallel Communication

Journal of the ACM (JACM)
The cosmic cube

Communications of the ACM - Special section on computer architecture
Virtual time

ACM Transactions on Programming Languages and Systems (TOPLAS)
MULTILISP: a language for concurrent symbolic computation

ACM Transactions on Programming Languages and Systems (TOPLAS)
Data parallel algorithms

Communications of the ACM - Special issue on parallelism
How to share memory in a distributed system

Journal of the ACM (JACM)
Parallel speedup of sequential machines: a defense of parallel computation thesis

ACM SIGACT News
Parallel hashing—an efficient implementation of shared memory

STOC '86 Proceedings of the eighteenth annual ACM symposium on Theory of computing
Reduced distance routing in single-state shuffle-exchange interconnection networks

SIGMETRICS '87 Proceedings of the 1987 ACM SIGMETRICS conference on Measurement and modeling of computer systems
New Connectivity and MSF Algorithms for Shuffle-Exchange Network and PRAM

IEEE Transactions on Computers
Parallelization and Performance Analysis of the Cooley-Tukey FFT Algorithm for Shared-Memory Architectures

IEEE Transactions on Computers
Applications considerations in the system design of highly concurrent multiprocessors

IEEE Transactions on Computers
Matrix operations on a multicomputer system with switchable main memory modules and dynamic control

IEEE Transactions on Computers
Semantic parallelization: a practical exercise in abstract interpretation

POPL '87 Proceedings of the 14th ACM SIGACT-SIGPLAN symposium on Principles of programming languages
Language and visualization support for large-scale concurrency

ICSE '88 Proceedings of the 10th international conference on Software engineering
Experiences with poker

PPEALS '88 Proceedings of the ACM/SIGPLAN conference on Parallel programming: experience with applications, languages and systems
Topologies' - computational messaging for multicomputers

C3P Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1
The Design, Analysis and Simulation of a Fault-Tolerant Interconnection Network Supporting the Fetch-and-Add Primitive

IEEE Transactions on Computers
2-D SIMD algorithms in the perfect shuffle networks

ISCA '89 Proceedings of the 16th annual international symposium on Computer architecture
Scans as Primitive Parallel Operations

IEEE Transactions on Computers
The periodic balanced sorting network

Journal of the ACM (JACM)
Parallel Parsing of Arithmetic Expressions

IEEE Transactions on Computers
“Topologies”—distributed objects on multicomputers

ACM Transactions on Computer Systems (TOCS)
A bridging model for parallel computation

Communications of the ACM
A Parallel Algorithm for Solving Sparse Triangular Systems

IEEE Transactions on Computers
Fast, Deterministic Routing, on Hypercubes, Using Small Buffers

IEEE Transactions on Computers
Making asynchronous parallelism safe for the world

POPL '90 Proceedings of the 17th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Deterministic sorting in nearly logarithmic time on the hypercube and related computers

STOC '90 Proceedings of the twenty-second annual ACM symposium on Theory of computing
Process coordination with fetch-and-increment

ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
Efficient parallel algorithms on restartable fail-stop processors

PODC '91 Proceedings of the tenth annual ACM symposium on Principles of distributed computing
Optimal routing of parentheses on the hypercube

SPAA '92 Proceedings of the fourth annual ACM symposium on Parallel algorithms and architectures
A VLSI decomposition of the deBruijn graph

Journal of the ACM (JACM)
The CM-5 Connection Machine: a scalable supercomputer

Communications of the ACM
A survey of PRAM simulation techniques

ACM Computing Surveys (CSUR)
Parallel solutions to geometric problems in the scan model of computation

Journal of Computer and System Sciences
Are multiport memories physically feasible?

ACM SIGARCH Computer Architecture News - Special issue on input/output in parallel computer systems
Are multiport memories physically feasible?

ACM SIGARCH Computer Architecture News
An efficient and fast parallel-connected component algorithm

Journal of the ACM (JACM)
Strategic directions in computer architecture

ACM Computing Surveys (CSUR) - Special ACM 50th-anniversary issue: strategic directions in computing research
The NYU ultracomputer—designing a MIMD, shared-memory parallel machine

25 years of the international symposia on Computer architecture (selected papers)
Connection Machine Lisp: fine-grained parallel symbolic processing

LFP '86 Proceedings of the 1986 ACM conference on LISP and functional programming
An Efficient General-Purpose Parallel Computer

Journal of the ACM (JACM)
Complexity Results for Permuting Data and Other Computations on Parallel Processors

Journal of the ACM (JACM)
Using Emulations to Enhance the Performance of Parallel Architectures

IEEE Transactions on Parallel and Distributed Systems
Highly parallel Ada—Ada on an ultracomputer

SIGAda '85 Proceedings of the 1985 annual ACM SIGAda international conference on Ada
An empirical evaluation of two memory-efficient directory methods

ISCA '90 Proceedings of the 17th annual international symposium on Computer Architecture
Issues related to MIMD shared-memory computers: the NYU ultracomputer approach

ISCA '85 Proceedings of the 12th annual international symposium on Computer architecture
A decentralized control, highly concurrent multiprocesssor

ISCA '85 Proceedings of the 12th annual international symposium on Computer architecture
Parallel hashing: an efficient implementation of shared memory

Journal of the ACM (JACM)
A Quantitative Evaluation of the Feasibility of, and Suitable Hardware Architectures for, an Adaptive, Parallel Finite-Element System

ACM Transactions on Mathematical Software (TOMS)
Basic Techniques for the Efficient Coordination of Very Large Numbers of Cooperating Sequential Processors

ACM Transactions on Programming Languages and Systems (TOPLAS)
An overview of computational complexity

Communications of the ACM
Embedding graphs with bounded treewidth into their optimal hypercubes

Journal of Algorithms
Efficient dynamic embeddings of binary trees into hypercubes

Journal of Algorithms
An Optimal Implementation of Broadcasting with Selective Reduction

IEEE Transactions on Parallel and Distributed Systems
Program Structuring for Effective Parallel Portability

IEEE Transactions on Parallel and Distributed Systems
Controlling memory access concurrency in efficient fault-tolerant parallel algorithms

Nordic Journal of Computing
Emulating an MIMD architecture

MICRO 15 Proceedings of the 15th annual workshop on Microprogramming
A parallel scan conversion algorithm with anti-aliasing for a general-purpose ultracomputer

SIGGRAPH '83 Proceedings of the 10th annual conference on Computer graphics and interactive techniques
The Boolean Vector Machine [BVM]

ISCA '83 Proceedings of the 10th annual international symposium on Computer architecture
The NYU Ultracomputer—designing a MIMD, shared-memory parallel machine (Extended Abstract)

ISCA '82 Proceedings of the 9th annual symposium on Computer Architecture
Implementation of multilisp: Lisp on a multiprocessor

LFP '84 Proceedings of the 1984 ACM Symposium on LISP and functional programming
Routing, merging and sorting on parallel models of computation

STOC '82 Proceedings of the fourteenth annual ACM symposium on Theory of computing
An efficient general purpose parallel computer

STOC '81 Proceedings of the thirteenth annual ACM symposium on Theory of computing
Universal schemes for parallel communication

STOC '81 Proceedings of the thirteenth annual ACM symposium on Theory of computing
Efficient schemes for parallel communication

PODC '82 Proceedings of the first ACM SIGACT-SIGOPS symposium on Principles of distributed computing
On parallel searching (Extended Abstract)

PODC '82 Proceedings of the first ACM SIGACT-SIGOPS symposium on Principles of distributed computing
The balanced sorting network

PODC '83 Proceedings of the second annual ACM symposium on Principles of distributed computing
Relations between concurrent-write models of parallel computation

PODC '84 Proceedings of the third annual ACM symposium on Principles of distributed computing
A probabilistic relation between desirable and feasible, models of parallel computation

STOC '84 Proceedings of the sixteenth annual ACM symposium on Theory of computing
The random access hierarchy

STOC '83 Proceedings of the fifteenth annual ACM symposium on Theory of computing
Nial: A candidate language for fifth generation computer systems

ACM '84 Proceedings of the 1984 annual conference of the ACM on The fifth generation challenge
Emulating Direct Products by Index-Shuffle Graphs

IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Data-parallel polygonization

Parallel Computing - Special issue: High performance computing with geographical data
The design and development of ZPL

Proceedings of the third ACM SIGPLAN conference on History of programming languages
Scan primitives for GPU computing

Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS symposium on Graphics hardware
An overview of computational complexity

ACM Turing award lectures
The NYU Ultracomputer Designing an MIMD Shared Memory Parallel Computer

IEEE Transactions on Computers
Quotient Networks

IEEE Transactions on Computers
Efficient VLSI Networks for Parallel Processing Based on Orthogonal Trees

IEEE Transactions on Computers
Mesh-Connected Computers with Broadcasting

IEEE Transactions on Computers
Optimality of a Two-Phase Strategy for Routing in Interconnection Networks

IEEE Transactions on Computers
A Layout for the Shuffle-Exchange Network with O(N2/log3/2N) Area

IEEE Transactions on Computers
Fast scan algorithms on graphics processors

Proceedings of the 22nd annual international conference on Supercomputing
Configurable emulated shared memory architecture for general purpose MP-SOCs and NOC regions

NOCS '09 Proceedings of the 2009 3rd ACM/IEEE International Symposium on Networks-on-Chip
Paper: Efficient logarithmic time parallel algorithms for the cholesky decomposition and gram-schmidt process

Parallel Computing
Space-Efficient Preprocessing Schemes for Range Minimum Queries on Static Arrays

SIAM Journal on Computing
Randomized PRAM Simulation

Fundamenta Informaticae

Quantified Score

Hi-index	0.09

Visualization

Abstract

A class of parallel processors potentially involving thousands of individual processing elements is described. The architecture is based on the perfect shuffle connection and has two favorable characteristics: (1) Each processor communicates with a fixed number of other processors. (2) Important communication functions can be accomplished in time proportional to the logarithm of the number of processors. A number of basic algorithms for these “ultracomputers” are presented, and physical design considerations are discussed in a preliminary fashion.