The art of computer programming, volume 2 (3rd ed.): seminumerical algorithms
The art of computer programming, volume 2 (3rd ed.): seminumerical algorithms
LFP '86 Proceedings of the 1986 ACM conference on LISP and functional programming
Connection Machine Lisp: fine-grained parallel symbolic processing
LFP '86 Proceedings of the 1986 ACM conference on LISP and functional programming
ACM Transactions on Programming Languages and Systems (TOPLAS)
Communications of the ACM
Development of an APL standard
APL '79 Proceedings of the international conference on APL: part 2
A programming language
Update to "data parallel algorithms"
Communications of the ACM
Communications of the ACM - Special issue on parallelism
Hypernet: A communication-efficient architecture for constructing massively parallel computers
IEEE Transactions on Computers
IMPULSE: a high performance processing unit for multiprocessors for scientific calculation
ISCA '88 Proceedings of the 15th Annual International Symposium on Computer architecture
Pipelined data parallel algorithms—concept and modeling
ICS '88 Proceedings of the 2nd international conference on Supercomputing
Time space tradeoffs in vector algorithms for APL functions
ACM SIGPLAN Notices
Compiling Fortran 8x array features for the connection machine computer system
PPEALS '88 Proceedings of the ACM/SIGPLAN conference on Parallel programming: experience with applications, languages and systems
Compiling C* programs for a hypercube multicomputer
PPEALS '88 Proceedings of the ACM/SIGPLAN conference on Parallel programming: experience with applications, languages and systems
What have we learnt from using real parallel machines to solve real problems?
C3P Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2
Large-grain pipelining on hypercube multiprocessors
C3P Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2
High-speed implementations of rule-based systems
ACM Transactions on Computer Systems (TOCS)
Algorithmic Techniques for Computer Vision on a Fine-Grained Parallel Machine
IEEE Transactions on Pattern Analysis and Machine Intelligence
How to write parallel programs: a guide to the perplexed
ACM Computing Surveys (CSUR)
SPAA '89 Proceedings of the first annual ACM symposium on Parallel algorithms and architectures
Massively parallel switch-level simulation: a feasibility study
DAC '89 Proceedings of the 26th ACM/IEEE Design Automation Conference
Data parallel simulation using time-warp on the connection machine
DAC '89 Proceedings of the 26th ACM/IEEE Design Automation Conference
Parallel compilation for a parallel machine
PLDI '89 Proceedings of the ACM SIGPLAN 1989 Conference on Programming language design and implementation
Logic simulation on massively parallel architectures
ISCA '89 Proceedings of the 16th annual international symposium on Computer architecture
Implementation of a hypersonic rarefied flow particle simulation on the connection machine
Proceedings of the 1989 ACM/IEEE conference on Supercomputing
Practical parallel supercomputing: examples from chemistry and physics
Proceedings of the 1989 ACM/IEEE conference on Supercomputing
Algorithms for Image Component Labeling on SIMD Mesh-Connected Computers
IEEE Transactions on Computers
Parallel Quicksort Using Fetch-And-Add
IEEE Transactions on Computers
A functional programming language compiler for massively parallel computers
LFP '90 Proceedings of the 1990 ACM conference on LISP and functional programming
Making asynchronous parallelism safe for the world
POPL '90 Proceedings of the 17th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Mixed Programming Metaphors in a Shared Dataspace Model of Concurrency
IEEE Transactions on Software Engineering
Parallel programming with coordination structures
POPL '91 Proceedings of the 18th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Scan primitives for vector computers
Proceedings of the 1990 ACM/IEEE conference on Supercomputing
Compiling APL for parallel and vector execution
APL '91 Proceedings of the international conference on APL '91
High Performance Software Testing on SIMD Machines
IEEE Transactions on Software Engineering
(SM)/sup 2/-II: A Large-Scale Multiprocessor for Sparse Matrix Calculations
IEEE Transactions on Computers
DAC '90 Proceedings of the 27th ACM/IEEE Design Automation Conference
Analysis of replicated data algorithms on processor array architectures
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
OOPSLA/ECOOP '90 Proceedings of the workshop on Object-based concurrent programming
Parallel expression in the APL2 language
IBM Systems Journal
The network architecture of the Connection Machine CM-5 (extended abstract)
SPAA '92 Proceedings of the fourth annual ACM symposium on Parallel algorithms and architectures
Fast Addition of Large Integers
IEEE Transactions on Computers
Exploiting task and data parallelism on a multicomputer
PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
An equational language for data-parallelism
PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
Compiling task and data parallel programs for iWarp
ACM SIGPLAN Notices - Workshop on languages, compilers and run-time environments for distributed memory multiprocessors
The CM-5 Connection Machine: a scalable supercomputer
Communications of the ACM
On the relation between functional and data parallel programming languages
FPCA '93 Proceedings of the conference on Functional programming languages and computer architecture
Autonomous SIMD flexibility in the MP-1 and MP-2
SPAA '93 Proceedings of the fifth annual ACM symposium on Parallel algorithms and architectures
Vector prefix addition on sub-bus mesh computers
SPAA '93 Proceedings of the fifth annual ACM symposium on Parallel algorithms and architectures
Approximate time-parallel simulation of queueing systems with losses
WSC '92 Proceedings of the 24th conference on Winter simulation
Data parallel programming: a transition from serial to parallel computing
SIGCSE '93 Proceedings of the twenty-fourth SIGCSE technical symposium on Computer science education
A data-parallel algorithm for three-dimensional Delaunay triangulation and its implementation
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Parallelizing complex scans and reductions
PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
Efficient compilation of high-level data parallel algorithms
SPAA '94 Proceedings of the sixth annual ACM symposium on Parallel algorithms and architectures
SPAA '94 Proceedings of the sixth annual ACM symposium on Parallel algorithms and architectures
Fast phylogenetic analysis on a massively parallel machine
ICS '94 Proceedings of the 8th international conference on Supercomputing
A bibliography on parallel parsing
ACM SIGPLAN Notices
Request Combining in Multiprocessors with Arbitrary Interconnection Networks
IEEE Transactions on Parallel and Distributed Systems
Trade-offs between communication throughput and parallel time
STOC '94 Proceedings of the twenty-sixth annual ACM symposium on Theory of computing
Cilk: an efficient multithreaded runtime system
PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Fundamental ideas for a parallel computing course
ACM Computing Surveys (CSUR)
ACM SIGPLAN Lisp Pointers
Programming parallel algorithms
Communications of the ACM
Detection and global optimization of reduction operations for distributed parallel machines
ICS '96 Proceedings of the 10th international conference on Supercomputing
Decomposition Abstraction in Parallel Rule Languages
IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
Toward a More Realistic Performance Evaluation of Interconnection Networks
IEEE Transactions on Parallel and Distributed Systems
Performance modeling of the IDES framework
Proceedings of the eleventh workshop on Parallel and distributed simulation
Mask verification on the connection machine
DAC '88 Proceedings of the 25th ACM/IEEE Design Automation Conference
Effective jump-pointer prefetching for linked data structures
ISCA '99 Proceedings of the 26th annual international symposium on Computer architecture
An inherently parlle large grained data flow environment
CSC '88 Proceedings of the 1988 ACM sixteenth annual conference on Computer science
Intensive Data Management in Parallel Systems: A Survey
Distributed and Parallel Databases
Programming languages and systems for prototyping concurrent applications
ACM Computing Surveys (CSUR)
Compiler-directed shared-memory communication for iterative parallel applications
Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Double standards: bringing task parallelism to HPF via the message passing interface
Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Communications of the ACM
Integrating SIMD into the undergraduate curriculum
CCSC '01 Proceedings of the sixth annual CCSC northeastern conference on The journal of computing in small colleges
The undecidability of associativity and commutativity analysis
ACM Transactions on Programming Languages and Systems (TOPLAS)
Adaptive data parallel methods for ecosystem monitoring
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Extending the data parallel paradigm with data-dependent operators
Parallel Computing - Parallel computing in image and video processing
Data-Parallel Programming on Multicomputers
IEEE Software
On Computing Complete Histograms of Images in Log (n) Steps Using Hypercubes
IEEE Transactions on Pattern Analysis and Machine Intelligence
Pipelined Data Parallel Algorithms-I: Concept and Modeling
IEEE Transactions on Parallel and Distributed Systems
Data-Parallel Programming on MIMD Computers
IEEE Transactions on Parallel and Distributed Systems
Massively Parallel Algorithms for Trace-Driven Cache Simulations
IEEE Transactions on Parallel and Distributed Systems
The Classification, Fusion, and Parallelization of Array Language Primitives
IEEE Transactions on Parallel and Distributed Systems
Asynchronous Problems on SIMD Parallel Computers
IEEE Transactions on Parallel and Distributed Systems
An Accumulative Parallel Skeleton for All
ESOP '02 Proceedings of the 11th European Symposium on Programming Languages and Systems
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
On the Structure of Concurrent Interpreters
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Adaptive Segmentation of MR Axial Brain Images Using Connected Components
IWVF-4 Proceedings of the 4th International Workshop on Visual Form
SVP: A Model Capturing Sets, Lists, Streams, and Parallelism
VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Communication Analysis for Multicomputer Compilers
PACT '94 Proceedings of the IFIP WG10.3 Working Conference on Parallel Architectures and Compilation Techniques
Parallel Compilation on Associative Computers
PACT '94 Proceedings of the IFIP WG10.3 Working Conference on Parallel Architectures and Compilation Techniques
MIMD programs on SIMD architectures
FRONTIERS '96 Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation
Measurement and Modeling of EARTH-MANNA Multithreaded Architecture
MASCOTS '96 Proceedings of the 4th International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems
Sourcebook of parallel computing
OOPAL: integrating array programming in object-oriented programming
OOPSLA '03 Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications
Interactive terrain rendering and volume visualization on the Princeton Engine
VIS '92 Proceedings of the 3rd conference on Visualization '92
Language support for data parallelism in pointer based dynamic data structures
CASCON '93 Proceedings of the 1993 conference of the Centre for Advanced Studies on Collaborative research: distributed computing - Volume 2
Parallel functional programming on recursively defined data via data-parallel recursion
Journal of Functional Programming
Two optimal parallel algorithms on the commutation class of a word
Theoretical Computer Science - Words, languages and combinatorics
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 14 - Volume 15
Glift: Generic, efficient, random-access GPU data structures
ACM Transactions on Graphics (TOG)
Interactive summed-area table generation for glossy environmental reflections
SIGGRAPH '05 ACM SIGGRAPH 2005 Sketches
Machine and collection abstractions for user-implemented data-parallel programming
Scientific Programming
A visual environment for specifying global reduction operations
International Journal of High Performance Computing and Networking
Pipelined hash-join on multithreaded architectures
DaMoN '07 Proceedings of the 3rd international workshop on Data management on new hardware
Sparse matrix computations on manycore GPU's
Proceedings of the 45th annual Design Automation Conference
C++ and Massively Parallel Computers
Scientific Programming - The First Annual Object-Oriented Numerics Conference (OON-SKI '93)
Development of Generic Field Classes for Finite Element and Finite Difference Problems
Scientific Programming - The First Annual Object-Oriented Numerics Conference (OON-SKI '93)
A theoretical approach to the use of cyberinfrastructure in geographical analysis
International Journal of Geographical Information Science
Efficient stream compaction on wide SIMD many-core architectures
Proceedings of the Conference on High Performance Graphics 2009
Efficient Multiplication of Polynomials on Graphics Hardware
APPT '09 Proceedings of the 8th International Symposium on Advanced Parallel Processing Technologies
Future Generation Computer Systems
The numerical risks of reduction operations in OpenMP
PDCS '07 Proceedings of the 19th IASTED International Conference on Parallel and Distributed Computing and Systems
Two-tree algorithms for full bandwidth broadcast, reduction and scan
Parallel Computing
Sort vs. Hash revisited: fast join implementation on modern multi-core CPUs
Proceedings of the VLDB Endowment
Multi-byte Regular Expression Matching with Speculation
RAID '09 Proceedings of the 12th International Symposium on Recent Advances in Intrusion Detection
Fast tridiagonal solvers on the GPU
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
Parallel multiclass classification using SVMs on GPUs
Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units
SIMD: an additional pattern for PLPP (pattern language for parallel programming)
Proceedings of the 14th Conference on Pattern Languages of Programs
State-of-the-art in heterogeneous computing
Scientific Programming
HotPar'09 Proceedings of the First USENIX conference on Hot topics in parallelism
Indexing large-scale raster geospatial data using massively parallel GPGPU computing
Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems
A parallel algorithm for fault simulation on the connection machine
ITC'88 Proceedings of the 1988 international conference on Test: new frontiers in testing
Parallel SAH k-D tree construction
Proceedings of the Conference on High Performance Graphics
Copperhead: compiling an embedded data parallel language
Proceedings of the 16th ACM symposium on Principles and practice of parallel programming
Fast Mersenne prime testing on the GPU
Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units
Applying parallel design techniques to template matching with GPUs
VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
SAH KD-tree construction on GPU
Proceedings of the ACM SIGGRAPH Symposium on High Performance Graphics
Spatial hardware implementation for sparse graph algorithms in GraphStep
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Parallel appearance-adaptive models for real-time object tracking using particle swarm optimization
ICCCI'11 Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part II
Fast updates on read-optimized databases using multi-core CPUs
Proceedings of the VLDB Endowment
Parallel prefix (scan) algorithms for MPI
EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface
Parallel prefix computation in the recursive dual-net
ICA3PP'10 Proceedings of the 10th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Modular resultant algorithm for graphics processors
ICA3PP'10 Proceedings of the 10th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
Reasoning about data-parallel pointer programs in a modal extension of separation logic
AMAST'06 Proceedings of the 11th international conference on Algebraic Methodology and Software Technology
Intel's Array Building Blocks: A retargetable, dynamic compiler and embedded language
CGO '11 Proceedings of the 9th Annual IEEE/ACM International Symposium on Code Generation and Optimization
Design and implementation of 812: A declarative data-parallel language
Computer Languages
Mealy machines are a better model of lexical analyzers
Computer Languages
Mat-core: a decoupled matrix core extension for general-purpose processors
Neural, Parallel & Scientific Computations
An algorithm for generating t-wise covering arrays from large feature models
Proceedings of the 16th International Software Product Line Conference - Volume 1
Load-balanced isosurfacing on multi-GPU clusters
EG PGV'10 Proceedings of the 10th Eurographics conference on Parallel Graphics and Visualization
Data-Parallel Decompression of Triangle Mesh Topology
Computer Graphics Forum
Accelerated parallel genetic programming tree evaluation with OpenCL
Journal of Parallel and Distributed Computing
An optimal parallel prefix-sums algorithm on the memory machine models for GPUs
ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Parallel multitask cross validation for Support Vector Machine using GPU
Journal of Parallel and Distributed Computing
Optimizing parallel belief propagation in junction treesusing regression
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
New algorithms for binary jumbled pattern matching
Information Processing Letters
Non-blocking parallel subset construction on shared-memory multicore architectures
AusPDC '13 Proceedings of the Eleventh Australasian Symposium on Parallel and Distributed Computing - Volume 140
Approximate pattern matching with k-mismatches in packed text
Information Processing Letters
Computing resultants on Graphics Processing Units: Towards GPU-accelerated computer algebra
Journal of Parallel and Distributed Computing
Data-parallel finite-state machines
Proceedings of the 19th international conference on Architectural support for programming languages and operating systems
Parallelizing dynamic programming through rank convergence
Proceedings of the 19th ACM SIGPLAN symposium on Principles and practice of parallel programming
Load balanced clustering coefficients
Proceedings of the first workshop on Parallel programming for analytics applications
Artificial Intelligence Review
A Speculative Parallel DFA Membership Test for Multicore, SIMD and Cloud Computing Environments
International Journal of Parallel Programming
Hi-index | 0.08 |
Parallel computers with tens of thousands of processors are typically programmed in a data parallel style, as opposed to the control parallel style used in multiprocessing. The success of data parallel algorithms—even on problems that at first glance seem inherently serial—suggests that this style of programming has much wider applicability than was previously thought.