CoMPI: Enhancing MPI Based Applications Performance and Scalability Using Run-Time Compression
Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
International Journal of High Performance Computing Applications
Floating-point data compression at 75 Gb/s on a GPU
Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units
Compressing the incompressible with ISABELA: in-situ reduction of spatio-temporal data
Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
Lossless compression of variable-precision floating-point buffers on GPUs
I3D '12 Proceedings of the ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games
ISOBAR hybrid compression-I/O interleaving for large-scale parallel I/O optimization
Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
Lossless asymmetric single instruction multiple data codec
Software—Practice & Experience
McrEngine: a scalable checkpointing system using data-aware aggregation and compression
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Byte-precision level of detail processing for variable precision analytics
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
An adaptive, scalable, and portable technique for speeding up MPI-based applications
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
The alpha parallelogram predictor: A lossless compression method for motion capture data
Information Sciences: an International Journal
ARC'13 Proceedings of the 9th international conference on Reconfigurable Computing: architectures, tools, and applications
McrEngine: A scalable checkpointing system using data-aware aggregation and compression
Scientific Programming - Selected Papers from Super Computing 2012
Hi-index | 14.98 |
Many scientific programs exchange large quantities of double-precision data between processing nodes and with mass storage devices. Data compression can reduce the number of bytes that need to be transferred and stored. However, compression is only likely to be employed in high-end computing environments if it does not impede the throughput. This paper describes and evaluates FPC, a fast lossless compression algorithm for linear streams of 64-bit floating-point data. FPC works well on hard-to-compress scientific datasets and meets the throughput demands of high-performance systems. A comparison with five lossless compression schemes, BZIP2, DFCM, FSD, GZIP, and PLMI, on four architectures and thirteen datasets shows that FPC compresses and decompresses one to two orders of magnitude faster than the other algorithms at the same geometric-mean compression ratio. Moreover, FPC provides a guaranteed throughput as long as the prediction tables fit into the L1 data cache. For example, on a 1.6 GHz Itanium 2 server, the throughput is 670 megabytes per second regardless of what data are being compressed.