The input/output complexity of sorting and related problems
Communications of the ACM
A bridging model for parallel computation
Communications of the ACM
A comparison of sorting algorithms for the connection machine CM-2
SPAA '91 Proceedings of the third annual ACM symposium on Parallel algorithms and architectures
LAPACK's user's guide
An introduction to parallel algorithms
An introduction to parallel algorithms
LogP: towards a realistic model of parallel computation
PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures
The network architecture of the connection machine CM-5
Journal of Parallel and Distributed Computing
Parallel algorithms for image histogramming and connected components with an experimental study
Journal of Parallel and Distributed Computing
Practical parallel algorithms for personalized communication and integer sorting
Journal of Experimental Algorithmics (JEA)
C3: a parallel model for coarse-grained machines
Journal of Parallel and Distributed Computing
Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures
A quantitative comparison of parallel computation models
Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures
The Journal of Supercomputing
The SGI Origin: a ccNUMA highly scalable server
Proceedings of the 24th annual international symposium on Computer architecture
ScaLAPACK user's guide
Truly efficient parallel algorithms: 1-optimal multisearch for an extension of the BSP model
ESA '95 Selected papers from the third European symposium on Algorithms
Randomized priority queues for fast parallel access
Journal of Parallel and Distributed Computing - Parallel and distributed data structures
A quantitative comparison of parallel computation models
ACM Transactions on Computer Systems (TOCS)
A randomized parallel sorting algorithm with an experimental study
Journal of Parallel and Distributed Computing
Lock bypassing: an efficient algorithm for concurrently accessing priority heaps
Journal of Experimental Algorithmics (JEA)
A new deterministic parallel sorting algorithm with an experimental evaluation
Journal of Experimental Algorithmics (JEA)
How to build a Beowulf: a guide to the implementation and application of PC clusters
How to build a Beowulf: a guide to the implementation and application of PC clusters
A simple and efficient parallel disk mergesort
Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures
Combinatorial algorithms test sets CATS: the ACM/EATCS platform for experimental research
Proceedings of the tenth annual ACM-SIAM symposium on Discrete algorithms
Journal of Parallel and Distributed Computing
How to present a paper on experimental work with algorithms
ACM SIGACT News
Towards a theory of cache-efficient algorithms
SODA '00 Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms
Is data distribution necessary in OpenMP?
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Prefix computations on symmetric multiprocessors
Journal of Parallel and Distributed Computing
Automatically tuned linear algebra software
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Synthesis of Parallel Algorithms
Synthesis of Parallel Algorithms
High-Performance Algorithm Engineering for Computational Phylogenetics
The Journal of Supercomputing - Special issue on computational issues in fluid dynamics optimization and simulation
Starfire: Extending the SMP Envelope
IEEE Micro
Parallel Bridging Models and Their Impact on Algorithm Design
ICCS '01 Proceedings of the International Conference on Computational Science-Part II
Practical Parallel Algorithms for Dynamic Data Redistribution, Median Finding, and Selection
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
The Paderborn University BSP (PUB) Library - Design, Implementation and Performance
IPPS '99/SPDP '99 Proceedings of the 13th International Symposium on Parallel Processing and the 10th Symposium on Parallel and Distributed Processing
Inversion Medians Outperform Breakpoint Medians in Phylogeny Reconstruction from Gene-Order Data
WABI '02 Proceedings of the Second International Workshop on Algorithms in Bioinformatics
Efficient Massively Parallel Quicksort
IRREGULAR '97 Proceedings of the 4th International Symposium on Solving Irregularly Structured Problems in Parallel
Realistic Parallel Algorithms: Priority Queue Operations and Selection for the BSP Model
Euro-Par '96 Proceedings of the Second International Euro-Par Conference on Parallel Processing-Volume II
Submachine Locality in the Bulk Synchronous Setting (Extended Abstract)
Euro-Par '96 Proceedings of the Second International Euro-Par Conference on Parallel Processing-Volume II
Run-Time Support for Multi-tier Programming of Block-Structured Applications on SMP Clusters
ISCOPE '97 Proceedings of the Scientific Computing in Object-Oriented Parallel Environments
SKaMPI: A Detailed, Accurate MPI Benchmark
Proceedings of the 5th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
FOCS '99 Proceedings of the 40th Annual Symposium on Foundations of Computer Science
A Probabilistic Algorithm for k-SAT and Constraint Satisfaction Problems
FOCS '99 Proceedings of the 40th Annual Symposium on Foundations of Computer Science
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Scalable data parallel algorithms for texture synthesis using Gibbs random fields
IEEE Transactions on Image Processing
Distributed verification: exploring the power of raw computing power
FMICS'06/PDMC'06 Proceedings of the 11th international workshop, FMICS 2006 and 5th international workshop, PDMC conference on Formal methods: Applications and technology
Algorithm engineering: bridging the gap between algorithm theory and practice
Algorithm engineering: bridging the gap between algorithm theory and practice
On the abstraction of message-passing communications using algorithmic skeletons: a case study
ISSADS'05 Proceedings of the 5th international conference on Advanced Distributed Systems
High-Performance algorithm engineering for large-scale graph problems and computational biology
WEA'05 Proceedings of the 4th international conference on Experimental and Efficient Algorithms
Hi-index | 0.00 |
The emerging discipline of algorithm engineering has primarily focused on transforming pencil-and-paper sequential algorithms into robust, efficient, well tested, and easily used implementations. As parallel computing becomes ubiquitous, we need to extend algorithm engineering techniques to parallel computation. Such an extension adds significant complications. After a short review of algorithm engineering achievements for sequential computing, we review the various complications caused by parallel computing, present some examples of successful efforts, and give a personal view of possible future research.