SpinThrift: saving energy in viral workloads

Authors:
Nishanth Sastry;Anthony Hylick;Jon Crowcroft
Affiliations:
University of Cambridge;University of Cambridge;University of Cambridge
Venue:
COMSNETS'10 Proceedings of the 2nd international conference on COMmunication systems and NETworks
Year:
2010

Citing 14
Cited 0

The Multi-Queue Replacement Algorithm for Second Level Buffer Caches

Proceedings of the General Track: 2002 USENIX Annual Technical Conference
Massive arrays of idle disks for storage archives

Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Energy conservation techniques for disk array-based servers

Proceedings of the 18th annual international conference on Supercomputing
Interplay of energy and performance for disk arrays running transaction processing workloads

ISPASS '03 Proceedings of the 2003 IEEE International Symposium on Performance Analysis of Systems and Software
The dynamics of viral marketing

ACM Transactions on the Web (TWEB)
Power provisioning for a warehouse-sized computer

Proceedings of the 34th annual international symposium on Computer architecture
I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system

Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
Dependability, access diversity, low cost: pick two

HotDep'07 Proceedings of the 3rd workshop on on Hot Topics in System Dependability
The Case for Energy-Proportional Computing

Computer
Optimizing power consumption in large scale storage systems

HOTOS'07 Proceedings of the 11th USENIX workshop on Hot topics in operating systems
Write off-loading: Practical power management for enterprise storage

ACM Transactions on Storage (TOS)
A measurement-driven analysis of information propagation in the flickr social network

Proceedings of the 18th international conference on World wide web
Buzztraq: predicting geographical access patterns of social cascades using social networks

Proceedings of the Second ACM EuroSys Workshop on Social Network Systems
Cutting the electric bill for internet-scale systems

Proceedings of the ACM SIGCOMM 2009 conference on Data communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper looks at optimising the energy costs for data storage when the work load is highly skewed by a large number of accesses from a few popular articles, but whose popularity varies dynamically. A typical example of such a work load is news article access, where the most popular is highly accessed, but which article is most popular keeps changing. The properties of dynamically changing popular content are investigated using a trace drawn from a social news web site. It is shown that a) popular content have a much larger window of interest than non-popular articles. i.e. popular articles typically have a more sustained interest rather than a brief surge of interest. b) popular content are accessed by multiple unrelated users. In contrast, articles whose accesses spread only virally, i.e. from friend to friend, are shown to have a tendency not to be popular. Using this data, we improve upon Popular Data Concentration (PDC), a technique which is used to save energy by spinning down disks that do not contain popular data. PDC requires keeping the data ordered by their popularity, which involves significant amount of data migration, when the most popular articles keep changing. In contrast, our technique, SpinThrift, detects popular data by the proportion of non-viral accesses made, and results in lesser data migration, whilst using a similar amount of energy as PDC.