Advanced compiler optimizations for supercomputers
Communications of the ACM - Special issue on parallelism
STOC '86 Proceedings of the eighteenth annual ACM symposium on Theory of computing
The Cray X-MP/model 24: a case study in pipelined architecture and vector processing
The Cray X-MP/model 24: a case study in pipelined architecture and vector processing
Scans as Primitive Parallel Operations
IEEE Transactions on Computers
Introduction to algorithms
Logarithmic time cost optimal parallel sorting is not yet fast in practice!
Proceedings of the 1990 ACM/IEEE conference on Supercomputing
Scan primitives for vector computers
Proceedings of the 1990 ACM/IEEE conference on Supercomputing
A comparison of sorting algorithms for the connection machine CM-2
SPAA '91 Proceedings of the third annual ACM symposium on Parallel algorithms and architectures
Journal of the ACM (JACM)
Supporting the hypercube programming model on mesh architectures: (a fast sorter for iWarp tori)
SPAA '92 Proceedings of the fourth annual ACM symposium on Parallel algorithms and architectures
List ranking and list scan on the Cray C-90
SPAA '94 Proceedings of the sixth annual ACM symposium on Parallel algorithms and architectures
Accounting for memory bank contention and delay in high-bandwidth multiprocessors
Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures
From AAPC algorithms to high performance permutation routing and sorting
Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures
Fast Parallel Sorting Under LogP: Experience with the CM-5
IEEE Transactions on Parallel and Distributed Systems
High-performance sorting on networks of workstations
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Accounting for Memory Bank Contention and Delay in High-Bandwidth Multiprocessors
IEEE Transactions on Parallel and Distributed Systems
Parallel sorting on cache-coherent DSM multiprocessors
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Expressing Irregular Computations in Modern Fortran Dialects
LCR '98 Selected Papers from the 4th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Performance characteristics of the Cray X1 and their implications for application performance tuning
Proceedings of the 18th annual international conference on Supercomputing
GPUTeraSort: high performance graphics co-processor sorting for large database management
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Irregular computations in Fortran - expression and implementation strategies
Scientific Programming
CellSort: high performance sorting on the cell processor
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Efficient gather and scatter operations on graphics processors
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Relational joins on graphics processors
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Fast sort on CPUs and GPUs: a case for bandwidth oblivious SIMD sort
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
An idiom-finding tool for increasing productivity of accelerators
Proceedings of the international conference on Supercomputing
ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I
Exploring the Tradeoffs between Programmability and Efficiency in Data-Parallel Accelerators
ACM Transactions on Computer Systems (TOCS)
Box-counting algorithm on GPU and multi-core CPU: an OpenCL cross-platform study
The Journal of Supercomputing
Computers and Electrical Engineering
Hi-index | 0.00 |