The Multi-Queue Replacement Algorithm for Second Level Buffer Caches
Proceedings of the General Track: 2002 USENIX Annual Technical Conference
Massive arrays of idle disks for storage archives
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Energy conservation techniques for disk array-based servers
Proceedings of the 18th annual international conference on Supercomputing
Interplay of energy and performance for disk arrays running transaction processing workloads
ISPASS '03 Proceedings of the 2003 IEEE International Symposium on Performance Analysis of Systems and Software
The dynamics of viral marketing
ACM Transactions on the Web (TWEB)
Power provisioning for a warehouse-sized computer
Proceedings of the 34th annual international symposium on Computer architecture
I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system
Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
Dependability, access diversity, low cost: pick two
HotDep'07 Proceedings of the 3rd workshop on on Hot Topics in System Dependability
Optimizing power consumption in large scale storage systems
HOTOS'07 Proceedings of the 11th USENIX workshop on Hot topics in operating systems
Write off-loading: Practical power management for enterprise storage
ACM Transactions on Storage (TOS)
A measurement-driven analysis of information propagation in the flickr social network
Proceedings of the 18th international conference on World wide web
Buzztraq: predicting geographical access patterns of social cascades using social networks
Proceedings of the Second ACM EuroSys Workshop on Social Network Systems
Cutting the electric bill for internet-scale systems
Proceedings of the ACM SIGCOMM 2009 conference on Data communication
Hi-index | 0.00 |
This paper looks at optimising the energy costs for data storage when the work load is highly skewed by a large number of accesses from a few popular articles, but whose popularity varies dynamically. A typical example of such a work load is news article access, where the most popular is highly accessed, but which article is most popular keeps changing. The properties of dynamically changing popular content are investigated using a trace drawn from a social news web site. It is shown that a) popular content have a much larger window of interest than non-popular articles. i.e. popular articles typically have a more sustained interest rather than a brief surge of interest. b) popular content are accessed by multiple unrelated users. In contrast, articles whose accesses spread only virally, i.e. from friend to friend, are shown to have a tendency not to be popular. Using this data, we improve upon Popular Data Concentration (PDC), a technique which is used to save energy by spinning down disks that do not contain popular data. PDC requires keeping the data ordered by their popularity, which involves significant amount of data migration, when the most popular articles keep changing. In contrast, our technique, SpinThrift, detects popular data by the proportion of non-viral accesses made, and results in lesser data migration, whilst using a similar amount of energy as PDC.