Brief announcement: TeraByte TokuSampleSort sorts 1TB in 197s

  • Authors:
  • Bradley C. Kuszmaul

  • Affiliations:
  • MIT, Cambridge, MA, USA

  • Venue:
  • Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The tx2500 disk cluster at MIT Lincoln Labortory sorted a terabyte (1010 100-byte records) in 197s using an "Indy" sort, and in 297s using a "Daytona" sort. The sort employed a parallel sample sort, and ran on 400 nodes, each containing a 6-disk RAID, and 8GB of memory, all connected by Infiniband. It employed TCP sockets to communicate between the nodes.