Scalable hashing for shared memory supercomputers

Authors:
Eric Goodman;M. Nicole Lemaster;Edward Jimenez
Affiliations:
Sandia National Laboratories, Albuquerque, NM;Sandia National Laboratories, Livermore, CA;Sandia National Laboratories, Albuquerque, NM
Venue:
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Year:
2011

Citing 12
Cited 0

Transactional memory: architectural support for lock-free data structures

ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
Graph structure in the Web

Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
High performance dynamic lock-free hash tables and list-based sets

Proceedings of the fourteenth annual ACM symposium on Parallel algorithms and architectures
Lock-free dynamic hash tables with open addressing

Distributed Computing - Special issue: PODC 02
Split-ordered lists: Lock-free extensible hash tables

Journal of the ACM (JACM)
A general lock-free algorithm using compare-and-swap

Information and Computation
Concurrent programming without locks

ACM Transactions on Computer Systems (TOCS)
Interpreting the data: Parallel analysis with Sawzall

Scientific Programming - Dynamic Grids and Worldwide Computing
MapReduce: simplified data processing on large clusters

Communications of the ACM - 50th anniversary issue: 1958 - 2008
Hopscotch Hashing

DISC '08 Proceedings of the 22nd international symposium on Distributed Computing
Lock-free parallel dynamic programming

Journal of Parallel and Distributed Computing
A fast, lock-free approach for efficient parallel counting of occurrences of k-mers

Bioinformatics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Hashing is a fundamental technique in computer science to allow O(1) insert and lookups of items in an associative array. Here we present several thread coordination and hashing strategies and compare and contrast their performance on large, shared memory symmetric multiprocessor machines, each possessing between a half to a full terabyte of memory. We show how our approach can be used as a key kernel for fundamental paradigms such as dynamic programming and MapReduce. We further show that a set of approaches yields close to linear speedup for both uniform random and more difficult power law distributions. This scalable performance is in spite of the fact that our set of approaches is not completely lock-free. Our experimental results utilize and compare an SGI Altix UV with 4 Xeon processors (32 cores) and a Cray XMT with 128 processors. On the scale of data we addressed, on the order of 5 billion integers, we show that the Altix UV far exceeds the performance of the Cray XMT for power law distributions. However, the Cray XMT exhibits greater scalability.