Parallel failure recovery techniques in cluster-based media servers

Authors:
Joahyung Lee;Inbum Jung
Affiliations:
Department of Computer Science and Engineering, Kangwon National University, Chuncheon, Korea;Department of Computer Science and Engineering, Kangwon National University, Chuncheon, Korea
Venue:
The Journal of Supercomputing
Year:
2010

Citing 14
Cited 0

Architectures and algorithms for on-line failure recovery in redundant disk arrays

Distributed and Parallel Databases - Special issue on disk arrays
Distributed schedule management in the Tiger video fileserver

Proceedings of the sixteenth ACM symposium on Operating systems principles
The interactive performance of SLIM: a stateless, thin-client architecture

Proceedings of the seventeenth ACM symposium on Operating systems principles
Multimedia servers: applications, environments, and design

Multimedia servers: applications, environments, and design
Modeling and Performance Comparison of Reliability Strategies for Distributed Video Servers

IEEE Transactions on Parallel and Distributed Systems
Data striping and reliability aspects in distributed video servers

Cluster Computing
Analytic Modeling and Comparisons of Striping Strategies for Replicated Disk Arrays

IEEE Transactions on Computers
Automatic Generation of Availability Models in RAScad

DSN '02 Proceedings of the 2002 International Conference on Dependable Systems and Networks
Data Storage and Retrieval for Video-on-Demand Servers

MSE '02 Proceedings of the Fourth IEEE International Symposium on Multimedia Software Engineering
A Multicast Delivery Scheme for VCR Operations in a Large VOD System

ICPADS '01 Proceedings of the Eighth International Conference on Parallel and Distributed Systems
Modeling the Caching Effect in Continuous Media Servers

Multimedia Tools and Applications
Caching and Scheduling in NAD-Based Multimedia Servers

IEEE Transactions on Parallel and Distributed Systems
Guest Editors' Introduction: Approaches to Recovery-Oriented Computing

IEEE Internet Computing
Resource consumption-aware QoS in cluster-based VOD servers

Journal of Systems Architecture: the EUROMICRO Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

For large-scale video-on-demand (VOD) service, cluster servers are highlighted due to their high performance and low cost. A cluster server consists of a front-end node and multiple backend nodes. Though the increase in backend nodes provides more quality of service (QoS) streams, the possibility of backend node failure is proportionally increased. The failure causes not only the cessation of streaming services but also the loss of current playing positions. In this paper, when a backend node fails, recovery mechanisms are studied to support the streaming service continuously. Without considering the characteristics of cluster-based servers and MPEG media, the basic redundant array of independent disks (RAID) techniques cause a network bottleneck in the internal network path and demonstrate inefficient CPU usage in backend nodes. To address these problems, a new failure recovery mechanism is proposed based on the pipeline computing concept. The proposed method not only distributes the internal network traffic generated from the recovery operations but also utilizes the CPU time available in the backend nodes. In the experiments, even if a backend node fails, the proposed method provides continuous streaming media services within a short MTTR value as well as more QoS streams than the existing method.