The Cyclone Server Architecture: streamlining delivery of popular content

Authors:
Stanislav Rost;John Byers;Azer Bestavros
Affiliations:
Department of Computer Science, Boston University, Boston, MA, USA;Department of Computer Science, Boston University, Boston, MA, USA;Department of Computer Science, Boston University, Boston, MA, USA
Venue:
Computer Communications
Year:
2002

Citing 20
Cited 0

Efficient dispersal of information for security, load balancing, and fault tolerance

Journal of the ACM (JACM)
Main memory caching of Web documents

Proceedings of the fifth international World Wide Web conference on Computer networks and ISDN systems
Using predictive prefetching to improve World Wide Web latency

ACM SIGCOMM Computer Communication Review
Practical loss-resilient codes

STOC '97 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing
Parity-based loss recovery for reliable multicast transmission

SIGCOMM '97 Proceedings of the ACM SIGCOMM '97 conference on Applications, technologies, architectures, and protocols for computer communication
Effective erasure codes for reliable computer communication protocols

ACM SIGCOMM Computer Communication Review
Self-similarity in World Wide Web traffic: evidence and possible causes

IEEE/ACM Transactions on Networking (TON)
Generating representative Web workloads for network and server performance evaluation

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
A digital fountain approach to reliable distribution of bulk data

Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication
IO-lite: a unified I/O buffering and caching system

OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Performance issues in WWW servers

SIGMETRICS '99 Proceedings of the 1999 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
An integrated congestion management architecture for Internet hosts

Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
A case for end system multicast (keynote address)

Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Critical path analysis of TCP transactions

Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication
Changes in Web client access patterns: Characteristics and caching implications

World Wide Web
AIDA-based real-time fault-tolerant broadcast disks

RTAS '96 Proceedings of the 2nd IEEE Real-Time Technology and Applications Symposium (RTAS '96)
Application-level document caching in the Internet

SDNE '95 Proceedings of the 2nd International Workshop on Services in Distributed and Networked Environments
Overcast: reliable multicasting with on overlay network

OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
Cost-aware WWW proxy caching algorithms

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Flash: an efficient and portable web server

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference

Quantified Score

Hi-index	0.24

Visualization

Abstract

We propose a new web server architecture optimized for delivery of large, popular files. Delivery of such files currently pose a scalability problem for conventional content providers, which must devote server-side resources in direct proportion to the high multiprogramming level induced by a set of these connections. While use of scalable multicast may remedy this problem some day, multicast is rarely supported in today's wide-area infrastructure. Our approach alleviates many of the most serious scalability problems by developing new server-side mechanisms capable of managing a large set of TCP connections transporting the same content. The strategy we employ relies on the use of fast forward error correcting (FEC) codes to generate encodings of popular content, of which only a small sliding window is cached in memory at any time instant. The concurrent TCP connections then access the content only from this shared window, which is globally useful to all clients. Our method hinges on eliminating unscalable TCP retransmission buffers, as we can ''retransmit'' fresh encoding packets in lieu of the originals with no performance degradation and with no modifications to client TCP stacks. Ultimately, our Cyclone server capitalizes on concurrency to maximize sharing of state across different request threads while minimizing context switching, disk I/O, thrashing under high load and the cache memory footprint. In this paper, we describe the design and prototype implementation of our approach as a Linux kernel subsystem.