Tesseract: reconciling guest I/O and hypervisor swapping in a VM

Authors:
Kapil Arya;Yury Baskakov;Alex Garthwaite
Affiliations:
Northeastern University, Boston, MA, USA;VMware, Inc., Cambridge, MA, USA;CloudPhysics, Inc., Hamilton, MA, USA
Venue:
Proceedings of the 10th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
Year:
2014

Citing 11
Cited 0

Cellular disco: resource management using virtual clusters on shared-memory multiprocessors

ACM Transactions on Computer Systems (TOCS)
Memory resource management in VMware ESX server

ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
Virtual clusters: resource management on large shared-memory multiprocessors

Virtual clusters: resource management on large shared-memory multiprocessors
Geiger: monitoring the buffer cache in a virtual machine environment

Proceedings of the 12th international conference on Architectural support for programming languages and operating systems
Virtual machine memory access tracing with hypervisor exclusive cache

ATC'07 2007 USENIX Annual Technical Conference on Proceedings of the USENIX Annual Technical Conference
The double paging anomaly

AFIPS '74 Proceedings of the May 6-10, 1974, national computer conference and exposition
PARDA: proportional allocation of resources for distributed storage access

FAST '09 Proccedings of the 7th conference on File and storage technologies
VM/370: a study of multiplicity and usefulness

IBM Systems Journal
Satori: enlightened page sharing

USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
Fast and space-efficient virtual machine checkpointing

Proceedings of the 7th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
VSwapper: a memory swapper for virtualized environments

Proceedings of the 19th international conference on Architectural support for programming languages and operating systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Double-paging is an often-cited, if unsubstantiated, problem in multi-level scheduling of memory between virtual machines (VMs) and the hypervisor. This problem occurs when both a virtualized guest and the hypervisor overcommit their respective physical address-spaces. When the guest pages out memory previously swapped out by the hypervisor, it initiates an expensive sequence of steps causing the contents to be read in from the hypervisor swapfile only to be written out again, significantly lengthening the time to complete the guest I/O request. As a result, performance rapidly drops. We present Tesseract, a system that directly and transparently addresses the double-paging problem. Tesseract tracks when guest and hypervisor I/O operations are redundant and modifies these I/Os to create indirections to existing disk blocks containing the page contents. Although our focus is on reconciling I/Os between the guest disks and hypervisor swap, our technique is general and can reconcile, or deduplicate, I/Os for guest pages read or written by the VM. Deduplication of disk blocks for file contents accessed in a common manner is well-understood. One challenge that our approach faces is that the locality of guest I/Os (reflecting the guest's notion of disk layout) often differs from that of the blocks in the hypervisor swap. This loss of locality through indirection results in significant performance loss on subsequent guest reads. We propose two alternatives to recovering this lost locality, each based on the idea of asynchronously reorganizing the indirected blocks in persistent storage. We evaluate our system and show that it can significantly reduce the costs of double-paging. We focus our experiments on a synthetic benchmark designed to highlight its effects. In our experiments we observe Tesseract can improve our benchmark's throughput by as much as 200% when using traditional disks and by as much as 30% when using SSD. At the same time worst case application responsiveness can be improved by a factor of 5.