I/O processing in a virtualized platform: a simulation-driven approach

Authors:
Vineet Chadha;Ramesh Illiikkal;Ravi Iyer;Jaideep Moses;Donald Newell;Renato J. Figueiredo
Affiliations:
University of Florida, Gainesville, FL;Intel Corporation, Hillsboro, OR;Intel Corporation, Hillsboro, OR;Intel Corporation, Hillsboro, OR;Intel Corporation, Hillsboro, OR;University of Florida, Gainesville, FL
Venue:
Proceedings of the 3rd international conference on Virtual execution environments
Year:
2007

Citing 12
Cited 8

Simics: A Full System Simulation Platform

Computer
Virtualizing I/O Devices on VMware Workstation's Hosted Virtual Machine Monitor

Proceedings of the General Track: 2002 USENIX Annual Technical Conference
Xen and the art of virtualization

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Diagnosing performance overheads in the xen virtual machine environment

Proceedings of the 1st ACM/USENIX international conference on Virtual execution environments
Guest Editors' Introduction: Resource Virtualization Renaissance

Computer
Virtual Machine Monitors: Current Technology and Future Trends

Computer
Virtual Machines: Versatile Platforms for Systems and Processes (The Morgan Kaufmann Series in Computer Architecture and Design)

Virtual Machines: Versatile Platforms for Systems and Processes (The Morgan Kaufmann Series in Computer Architecture and Design)
Measuring CPU overhead for I/O processing in the Xen virtual machine monitor

ATEC '05 Proceedings of the annual conference on USENIX Annual Technical Conference
Are virtual machine monitors microkernels done right?

HOTOS'05 Proceedings of the 10th conference on Hot Topics in Operating Systems - Volume 10
QoS policies and architecture for cache/memory in CMP platforms

Proceedings of the 2007 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Optimizing network virtualization in Xen

ATEC '06 Proceedings of the annual conference on USENIX '06 Annual Technical Conference
High performance VMM-bypass I/O in virtual machines

ATEC '06 Proceedings of the annual conference on USENIX '06 Annual Technical Conference

Characterization & analysis of a server consolidation benchmark

Proceedings of the fourth ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
Towards modeling & analysis of consolidated CMP servers

ACM SIGARCH Computer Architecture News
Does virtualization make disk scheduling passé?

ACM SIGOPS Operating Systems Review
qTLB: looking inside the look-aside buffer

HiPC'07 Proceedings of the 14th international conference on High performance computing
The effect of multi-core on HPC applications in virtualized systems

Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
Providing performance guarantees to virtual machines using real-time scheduling

Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
FIOS: a flexible virtualized I/O subsystem to alleviate interference among virtual machines

Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication
vSlicer: latency-aware virtual machine scheduling via differentiated-frequency CPU slicing

Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Virtualization provides levels of execution isolation and service partition that are desirable in many usage scenarios, but its associated overheads are a major impediment for wide deployment of virtualized environments. While the virtualization cost depends heavily on workloads, it has been demonstrated that the overhead is much higher with I/O intensive workloads compared to those which are compute-intensive. Unfortunately, the architectural reasons behind the I/O performance overheads are not well understood. Early research in characterizing these penalties has shown that cache misses and TLB related overheads contribute to most of I/O virtualization cost. While most of these evaluations are done using measurements, in this paper we present an execution-driven simulation based analysis methodology with symbol annotation as a means of evaluating the performance of virtualized workloads. This methodology provides detailed information at the architectural level (with a focus on cache and TLB) and allows designers to evaluate potential hardware enhancements to reduce virtualization overhead. We apply this methodology to study the network I/O performance of Xen (as a case study) in a full system simulation environment, using detailed cache and TLB models to profile and characterize software and hardware hotspots. By applying symbol annotation to the instruction flow reported by the execution driven simulator we derive function level call flow information. We follow the anatomy of I/O processing in a virtualized platform for network transmit and receive scenarios and demonstrate the impact of cache scaling and TLB size scaling on performance.