FPGASort: a high performance sorting architecture exploiting run-time reconfiguration on fpgas for large problem sorting

Authors:
Dirk Koch;Jim Torresen
Affiliations:
University of Oslo, Oslo, Norway;University of Oslo, Oslo, Norway
Venue:
Proceedings of the 19th ACM/SIGDA international symposium on Field programmable gate arrays
Year:
2011

Citing 8
Cited 5

Tradeoff Analysis and Architecture Design of a Hybrid Hardware/Software Sorter

ASAP '00 Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures, and Processors
Implementing sorting in database systems

ACM Computing Surveys (CSUR)
GPUTeraSort: high performance graphics co-processor sorting for large database management

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Scan primitives for GPU computing

Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS symposium on Graphics hardware
CellSort: high performance sorting on the cell processor

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Sorting networks and their applications

AFIPS '68 (Spring) Proceedings of the April 30--May 2, 1968, spring joint computer conference
Data processing on FPGAs

Proceedings of the VLDB Endowment
Towards rapid dynamic partial reconfiguration in video-based driver assistance systems

ARC'10 Proceedings of the 6th international conference on Reconfigurable Computing: architectures, Tools and Applications

Automating resource optimisation in reconfigurable design (abstract only)

Proceedings of the ACM/SIGDA international symposium on Field programmable gate arrays
Less watts, more performance: an intelligent storage engine for data appliances

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Optimization of address-based data sorting unit with external memory support

Proceedings of the 14th International Conference on Computer Systems and Technologies
Combining computation and communication optimizations in system synthesis for streaming applications

Proceedings of the 2014 ACM/SIGDA international symposium on Field-programmable gate arrays
Hardware acceleration of database operations

Proceedings of the 2014 ACM/SIGDA international symposium on Field-programmable gate arrays

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper analyses different hardware sorting architectures in order to implement a highly scaleable sorter for solving huge problems at high performance up to the GB range in linear time complexity. It will be proven that a combination of a FIFO-based merge sorter and a tree-based merge sorter results in the best performance at low cost. Moreover, we will demonstrate how partial run-time reconfiguration can be used for saving almost half the FPGA resources or alternatively for improving the speed. Experiments show a sustainable sorting throughput of 2GB/s for problems fitting into the on-chip FPGA memory and 1 GB/s when using external memory. These values surpass the best published results on large problem sorting implementations on FPGAs, GPUs, and the Cell processor.