Accelerating two-dimensional page walks for virtualized systems

Authors:
Ravi Bhargava;Benjamin Serebrin;Francesco Spadini;Srilatha Manne
Affiliations:
Advanced Micro Devices, Austin, TX;Advanced Micro Devices, Sunnyvale, CA;Advanced Micro Devices, Austin, TX;Advanced Micro Devices, Bellevue, WA
Venue:
Proceedings of the 13th international conference on Architectural support for programming languages and operating systems
Year:
2008

Citing 10
Cited 42

A new page table for 64-bit address spaces

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
A look at several memory management units, TLB-refill mechanisms, and page table organizations

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Recency-based TLB preloading

Proceedings of the 27th annual international symposium on Computer architecture
Formal requirements for virtualizable third generation architectures

Communications of the ACM
Address space sparsity and fine granularity

EW 6 Proceedings of the 6th workshop on ACM SIGOPS European workshop: Matching operating systems to application needs
Going the distance for TLB prefetching: an application-driven study

ISCA '02 Proceedings of the 29th annual international symposium on Computer architecture
Using SimPoint for accurate and efficient simulation

SIGMETRICS '03 Proceedings of the 2003 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Microarchitecture of HaL's memory management unit

COMPCON '95 Proceedings of the 40th IEEE Computer Society International Conference
A comparison of software and hardware techniques for x86 virtualization

Proceedings of the 12th international conference on Architectural support for programming languages and operating systems
Virtual Machines: Versatile Platforms for Systems and Processes (The Morgan Kaufmann Series in Computer Architecture and Design)

Virtual Machines: Versatile Platforms for Systems and Processes (The Morgan Kaufmann Series in Computer Architecture and Design)

Virtual machines jailed: virtualization in systems with small trusted computing bases

Proceedings of the 1st EuroSys Workshop on Virtualization Technology for Dependable Systems
NOVA: a microhypervisor-based secure virtualization architecture

Proceedings of the 5th European conference on Computer systems
On the DMA mapping problem in direct device assignment

Proceedings of the 3rd Annual Haifa Experimental Systems Conference
Translation caching: skip, don't walk (the page table)

Proceedings of the 37th annual international symposium on Computer architecture
The evolution of an x86 virtual machine monitor

ACM SIGOPS Operating Systems Review
The turtles project: design and implementation of nested virtualization

OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Synergistic TLBs for High Performance Address Translation in Chip Multiprocessors

MICRO '43 Proceedings of the 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture
Performance profiling of virtual machines

Proceedings of the 7th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
Minimal-overhead virtualization of a large scale supercomputer

Proceedings of the 7th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
Selective hardware/software memory virtualization

Proceedings of the 7th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
Optimizing virtual machines using hybrid virtualization

Proceedings of the 2011 ACM Symposium on Applied Computing
A light-weight virtual machine monitor for Blue Gene/P

Proceedings of the 1st International Workshop on Runtime and Operating Systems for Supercomputers
The best of both worlds with on-demand virtualization

HotOS'13 Proceedings of the 13th USENIX conference on Hot topics in operating systems
Enhancing virtualized application performance through dynamic adaptive paging mode selection

Proceedings of the 8th ACM international conference on Autonomic computing
SpecTLB: a mechanism for speculative address translation

Proceedings of the 38th annual international symposium on Computer architecture
DriverGuard: a fine-grained protection on I/O flows

ESORICS'11 Proceedings of the 16th European conference on Research in computer security
HICAMP: architectural support for efficient concurrency-safe shared structured data access

ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems
ELI: bare-metal performance for I/O virtualization

ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems
Modeling TCG-Based secure systems with colored petri nets

INTRUST'10 Proceedings of the Second international conference on Trusted Systems
A lightweight virtual machine monitor for Blue Gene/P

International Journal of High Performance Computing Applications
Virtualizing performance counters

Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing
Analytical bounds for optimal tile size selection

CC'12 Proceedings of the 21st international conference on Compiler Construction
Reducing memory reference energy with opportunistic virtual caching

Proceedings of the 39th Annual International Symposium on Computer Architecture
Revisiting hardware-assisted page walks for virtualized systems

Proceedings of the 39th Annual International Symposium on Computer Architecture
Software techniques for avoiding hardware virtualization exits

USENIX ATC'12 Proceedings of the 2012 USENIX conference on Annual Technical Conference
Optimizing virtual machines using hybrid virtualization

Journal of Systems and Software
Towards exitless and efficient paravirtual I/O

Proceedings of the 5th Annual International Systems and Storage Conference
Lockdown: towards a safe and practical architecture for security applications on commodity platforms

TRUST'12 Proceedings of the 5th international conference on Trust and Trustworthy Computing
Bringing Virtualization to the x86 Architecture with the Original VMware Workstation

ACM Transactions on Computer Systems (TOCS)
Dune: safe user-level access to privileged CPU features

OSDI'12 Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
Efficient virtualization on embedded power architecture® platforms

Proceedings of the eighteenth international conference on Architectural support for programming languages and operating systems
Efficient live migration of virtual machines using shared storage

Proceedings of the 9th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
CoLT: Coalesced Large-Reach TLBs

MICRO-45 Proceedings of the 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture
Improving virtualization in the presence of software managed translation lookaside buffers

Proceedings of the 40th Annual International Symposium on Computer Architecture
Efficient virtual memory for big memory servers

Proceedings of the 40th Annual International Symposium on Computer Architecture
An optimized page translation for mobile virtualization

Proceedings of the 50th Annual Design Automation Conference
Lightweight snapshots and system-level backtracking

HotOS'13 Proceedings of the 14th USENIX conference on Hot Topics in Operating Systems
Improving compute node performance using virtualization

International Journal of High Performance Computing Applications
Guide-copy: fast and silent migration of virtual machine for datacenters

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
COLO: COarse-grained LOck-stepping virtual machines for non-stop service

Proceedings of the 4th annual Symposium on Cloud Computing
Large-reach memory management unit caches

Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture
Revisiting memory management on virtualized environments

ACM Transactions on Architecture and Code Optimization (TACO)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Nested paging is a hardware solution for alleviating the software memory management overhead imposed by system virtualization. Nested paging complements existing page walk hardware to form a two-dimensional (2D) page walk, which reduces the need for hypervisor intervention in guest page table management. However, the extra dimension also increases the maximum number of architecturally-required page table references. This paper presents an in-depth examination of the 2D page table walk overhead and options for decreasing it. These options include using the AMD Opteron processor's page walk cache to exploit the strong reuse of page entry references. For a mix of server and SPEC benchmarks, the presented results show a 15%-38% improvement in guest performance by extending the existing page walk cache to also store the nested dimension of the 2D page walk. Caching nested page table translations and skipping multiple page entry references produce an additional 3%-7% improvement. Much of the remaining 2D page walk overhead is due to low-locality nested page entry references, which result in additional memory hierarchy misses. By using large pages, the hypervisor can eliminate many of these long-latency accesses and further improve the guest performance by 3%-22%.