No bit left behind: the limits of heap data compression

Authors:
Jennifer B. Sartor;Martin Hirzel;Kathryn S. McKinley
Affiliations:
The University of Texas at Austin, Austin, TX, USA;IBM Watson Research Center, Hawthorne, NY, USA;The University of Texas at Austin, Austin, TX, USA
Venue:
Proceedings of the 7th international symposium on Memory management
Year:
2008

Citing 24
Cited 6

Code compression

Proceedings of the ACM SIGPLAN 1997 conference on Programming language design and implementation
Compressing Java class files

Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
Dynamically discovering likely program invariants to support program evolution

Proceedings of the 21st international conference on Software engineering
Bidwidth analysis with application to silicon compilation

PLDI '00 Proceedings of the ACM SIGPLAN 2000 conference on Programming language design and implementation
Heap profiling for space-efficient Java

Proceedings of the ACM SIGPLAN 2001 conference on Programming language design and implementation
Bytecode compression via profiled grammar rewriting

Proceedings of the ACM SIGPLAN 2001 conference on Programming language design and implementation
Understanding the connectivity of heap objects

Proceedings of the 3rd international symposium on Memory management
Tracking down software bugs using automatic anomaly detection

Proceedings of the 24th International Conference on Software Engineering
Space- and Time-Efficient Implementation of the Java Object Model

ECOOP '02 Proceedings of the 16th European Conference on Object-Oriented Programming
Data Compression Transformations for Dynamically Allocated Data Structures

CC '02 Proceedings of the 11th International Conference on Compiler Construction
Data size optimizations for java programs

Proceedings of the 2003 ACM SIGPLAN conference on Language, compiler, and tool for embedded systems
Heap compression for memory-constrained Java environments

OOPSLA '03 Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications
Object equality profiling

OOPSLA '03 Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications
The Jalapeño virtual machine

IBM Systems Journal
Field level analysis for heap space optimization in embedded java environments

Proceedings of the 4th international symposium on Memory management
Exploiting frequent field values in java objects for reducing heap memory requirements

Proceedings of the 1st ACM/USENIX international conference on Virtual execution environments
Runtime specialization with optimistic heap analysis

OOPSLA '05 Proceedings of the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
The DaCapo benchmarks: java benchmarking development and analysis

Proceedings of the 21st annual ACM SIGPLAN conference on Object-oriented programming systems, languages, and applications
Virgil: objects on the head of a pin

Proceedings of the 21st annual ACM SIGPLAN conference on Object-oriented programming systems, languages, and applications
The ExoVM system for automatic VM and application reduction

Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
Offline compression for on-chip ram

Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
Vertical object layout and compression for fixed heaps

CASES '07 Proceedings of the 2007 international conference on Compilers, architecture, and synthesis for embedded systems
Accordion arrays

Proceedings of the 6th international symposium on Memory management
The causes of bloat, the limits of health

Proceedings of the 22nd annual ACM SIGPLAN conference on Object-oriented programming systems and applications

Z-rays: divide arrays and conquer speed and flexibility

PLDI '10 Proceedings of the 2010 ACM SIGPLAN conference on Programming language design and implementation
Decoupled zero-compressed memory

Proceedings of the 6th International Conference on High Performance and Embedded Architectures and Compilers
“Slimming” a Java virtual machine by way of cold code removal and optimistic partial program loading

Science of Computer Programming
Continuous object access profiling and optimizations to overcome the memory wall and bloat

ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems
Finding reusable data structures

Proceedings of the ACM international conference on Object oriented programming systems languages and applications
Storage strategies for collections in dynamically typed languages

Proceedings of the 2013 ACM SIGPLAN international conference on Object oriented programming systems languages & applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

On one hand, the high cost of memory continues to drive demand for memory efficiency on embedded and general purpose computers. On the other hand, programmers are increasingly turning to managed languages like Java for their functionality, programmability, and reliability. Managed languages, however, are not known for their memory efficiency, creating a tension between productivity and performance. This paper examines the sources and types of memory inefficiencies in a set of Java benchmarks. Although prior work has proposed specific heap data compression techniques, they are typically restricted to one model of inefficiency. This paper generalizes and quantitatively compares previously proposed memorysaving approaches and idealized heap compaction. It evaluates a variety of models based on strict and deep object equality, field value equality, removing bytes that are zero, and compressing fields and arrays with a limited number and range of values. The results show that substantial memory reductions are possible in the Java heap. For example, removing bytes that are zero from arrays is particularly effective, reducing the application's memory footprint by 41% on average.We are the first to combine multiple savings models on the heap, which very effectively reduces the application by up to 86%, on average 58%. These results demonstrate that future work should be able to combine a high productivity programming language with memory efficiency.