VGRIS: virtualized GPU resource isolation and scheduling in cloud gaming

Authors:
Miao Yu;Chao Zhang;Zhengwei Qi;Jianguo Yao;Yin Wang;Haibing Guan
Affiliations:
Shanghai Key Laboratory of Scalable Computing and Systems. School of Software, Shanghai Jiao Tong University, Shanghai, China;Shanghai Key Laboratory of Scalable Computing and Systems. School of Software, Shanghai Jiao Tong University, Shanghai, China;Shanghai Key Laboratory of Scalable Computing and Systems. School of Software, Shanghai Jiao Tong University, Shanghai, China;Shanghai Key Laboratory of Scalable Computing and Systems. School of Software, Shanghai Jiao Tong University, Shanghai, China;HP Labs, Palo Alto, USA;Shanghai Key Laboratory of Scalable Computing and Systems. School of Software, Shanghai Jiao Tong University, Shanghai, China
Venue:
Proceedings of the 22nd international symposium on High-performance parallel and distributed computing
Year:
2013

Citing 28
Cited 0

Borrowed-virtual-time (BVT) scheduling: supporting latency-sensitive threads in a general-purpose scheduler

Proceedings of the seventeenth ACM symposium on Operating systems principles
Xen and the art of virtualization

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Storage performance virtualization via throughput and latency control

ACM Transactions on Storage (TOS)
VMM-independent graphics acceleration

Proceedings of the 3rd international conference on Virtual execution environments
Proportional-share scheduling for distributed storage systems

FAST '07 Proceedings of the 5th USENIX conference on File and Storage Technologies
Comparison of the three CPU schedulers in Xen

ACM SIGMETRICS Performance Evaluation Review
A hybrid thin-client protocol for multimedia streaming and interactive gaming applications

Proceedings of the 2006 international workshop on Network and operating systems support for digital audio and video
Streaming Scenes to MPEG-4 Video-Enabled Devices

IEEE Computer Graphics and Applications
GViM: GPU-accelerated virtual machines

Proceedings of the 3rd ACM Workshop on System-level Virtualization for High Performance Computing
vCUDA: GPU accelerated high performance computing in virtual machines

IPDPS '09 Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing
Platform for distributed 3D gaming

International Journal of Computer Games Technology - Special issue on cyber games and interactive entertainment
GPU virtualization on VMware's hosted I/O architecture

ACM SIGOPS Operating Systems Review
Supporting soft real-time tasks in the xen hypervisor

Proceedings of the 6th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
Differential virtual time (DVT): rethinking I/O service differentiation for virtual machines

Proceedings of the 1st ACM symposium on Cloud computing
An efficient implementation of GPU virtualization in high performance clusters

Euro-Par'09 Proceedings of the 2009 international conference on Parallel processing
Real-time Enhancement for Xen Hypervisor

EUC '10 Proceedings of the 2010 IEEE/IFIP International Conference on Embedded and Ubiquitous Computing
Resource Sharing in GPU-Accelerated Windowing Systems

RTAS '11 Proceedings of the 2011 17th IEEE Real-Time and Embedded Technology and Applications Symposium
Shadowfax: scaling in heterogeneous cluster systems via GPGPU assemblies

Proceedings of the 5th international workshop on Virtualization technologies in distributed computing
Experience of parallelizing cryo-EM 3D reconstruction on a CPU-GPU heterogeneous system

Proceedings of the 20th international symposium on High performance distributed computing
Supporting GPU sharing in cloud environments with a transparent runtime consolidation framework

Proceedings of the 20th international symposium on High performance distributed computing
TimeGraph: GPU scheduling for real-time multi-tasking environments

USENIXATC'11 Proceedings of the 2011 USENIX conference on USENIX annual technical conference
Pegasus: coordinated scheduling for virtualized accelerator-based systems

USENIXATC'11 Proceedings of the 2011 USENIX conference on USENIX annual technical conference
Globally scheduled real-time multiprocessor systems with GPUs

Real-Time Systems
vSlicer: latency-aware virtual machine scheduling via differentiated-frequency CPU slicing

Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
A virtual memory based runtime to support multi-tenancy in clusters with GPUs

Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
Interference-driven resource management for GPU-based heterogeneous clusters

Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
Automatic Resource Scheduling with Latency Hiding for Parallel Stencil Applications on GPGPU Clusters

IPDPS '12 Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium
Virtual Machine Resource Allocation for Service Hosting on Heterogeneous Distributed Platforms

IPDPS '12 Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium

Quantified Score

Hi-index	0.00

Visualization

Abstract

Fueled by the maturity of virtualization technology for Graphics Processing Unit (GPU), there is an increasing number of data centers dedicated to GPU-related computation tasks in cloud gaming. However, GPU resource sharing in these applications is usually poor. This stems from the fact that the typical cloud gaming service providers often allocate one GPU exclusively for one game. To achieve the efficiency of computational resource management, there is a demand for cloud computing to employ the multi-task scheduling technologies to improve the utilization of GPU. In this paper, we propose VGRIS, a resource management framework for Virtualized GPU Resource Isolation and Scheduling in cloud gaming. By leveraging the mature GPU paravirtualization architecture, VGRIS resides in the host through library API interception, while the guest OS and the GPU computing applications remain unmodified. In the proposed framework, we implemented three scheduling algorithms in VGRIS for different objectives, i.e., Service Level Agreement (SLA)-aware scheduling, proportional-share scheduling, and hybrid scheduling that mixes the former two. By designing such a scheduling framework, it is possible to handle different kinds of GPU computation tasks for different purposes in cloud gaming. Our experimental results show that each scheduling algorithm can achieve its goals under various workloads.