A five-year study of file-system metadata
ACM Transactions on Storage (TOS)
GreenFS: making enterprise computers greener by protecting them better
Proceedings of the 3rd ACM SIGOPS/EuroSys European Conference on Computer Systems 2008
Proceedings of the 2008 ACM symposium on Applied computing
A nine year study of file system and storage benchmarking
ACM Transactions on Storage (TOS)
HPDC '08 Proceedings of the 17th international symposium on High performance distributed computing
Measurement and analysis of large-scale network file system workloads
ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
SCAN-Lite: enterprise-wide analysis on the cheap
Proceedings of the 4th ACM European conference on Computer systems
The case of the fake Picasso: preventing history forgery with secure provenance
FAST '09 Proccedings of the 7th conference on File and storage technologies
Generating realistic impressions for file-system benchmarking
FAST '09 Proccedings of the 7th conference on File and storage technologies
Spyglass: fast, scalable metadata search for large-scale storage systems
FAST '09 Proccedings of the 7th conference on File and storage technologies
Story book: an efficient extensible provenance framework
TAPP'09 First workshop on on Theory and practice of provenance
A multiple-file write scheme for improving write performance of small files in Fast File System
Information Processing Letters
Preventing history forgery with secure provenance
ACM Transactions on Storage (TOS)
Generating realistic impressions for file-system benchmarking
ACM Transactions on Storage (TOS)
TCP revisited: a fresh look at TCP in the wild
Proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference
MHS: A distributed metadata management strategy
Journal of Systems and Software
SQCK: a declarative file system checker
OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
TrapperKeeper: the case for using virtualization to add type awareness to file systems
HotStorage'10 Proceedings of the 2nd USENIX conference on Hot topics in storage and file systems
On securing untrusted clouds with cryptography
Proceedings of the 9th annual ACM workshop on Privacy in the electronic society
CacheCard: caching static and dynamic content on the NIC
Proceedings of the 5th ACM/IEEE Symposium on Architectures for Networking and Communications Systems
Fast and secure laptop backups with encrypted de-duplication
LISA'10 Proceedings of the 24th international conference on Large installation system administration
A study of practical deduplication
FAST'11 Proceedings of the 9th USENIX conference on File and stroage technologies
Scale and concurrency of GIGA+: file system directories with millions of files
FAST'11 Proceedings of the 9th USENIX conference on File and stroage technologies
Emulating Goliath storage systems with David
FAST'11 Proceedings of the 9th USENIX conference on File and stroage technologies
Just-in-time analytics on large file systems
FAST'11 Proceedings of the 9th USENIX conference on File and stroage technologies
Design implications for enterprise storage systems via multi-dimensional trace analysis
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
A file is not a file: understanding the I/O behavior of Apple desktop applications
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Towards reliable storage systems
Towards reliable storage systems
Emulating goliath storage systems with David
ACM Transactions on Storage (TOS)
A study of practical deduplication
ACM Transactions on Storage (TOS)
HPDA: A hybrid parity-based disk array for enhanced performance and reliability
ACM Transactions on Storage (TOS)
Robust benchmarking for archival storage tiers
Proceedings of the sixth workshop on Parallel Data Storage
Analysis of Workload Behavior in Scientific and Historical Long-Term Data Repositories
ACM Transactions on Storage (TOS)
Characteristics of backup workloads in production systems
FAST'12 Proceedings of the 10th USENIX conference on File and Storage Technologies
A File Is Not a File: Understanding the I/O Behavior of Apple Desktop Applications
ACM Transactions on Computer Systems (TOCS)
Generating realistic datasets for deduplication analysis
USENIX ATC'12 Proceedings of the 2012 USENIX conference on Annual Technical Conference
Benchmarking and modeling disk-based storage tiers for practical storage design
ACM SIGMETRICS Performance Evaluation Review
Usage behavior of a large-scale scientific archive
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Examining extended and scientific metadata for scalable index designs
Proceedings of the 6th International Systems and Storage Conference
SAFE: A Source Deduplication Framework for Efficient Cloud Backup Services
Journal of Signal Processing Systems
Building workload-independent storage with VT-trees
FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies
Virtual machine workloads: the case for new benchmarks for NAS
FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies
(Big)data in a virtualized world: volume, velocity, and variety in cloud datacenters
FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
Hi-index | 0.00 |
For five years, we collected annual snapshots of filesystem metadata from over 60,000 Windows PC file systems in a large corporation. In this paper, we use these snapshots to study temporal changes in file size, file age, file-type frequency, directory size, namespace structure, file-system population, storage capacity and consumption, and degree of file modification. We present a generative model that explains the namespace structure and the distribution of directory sizes. We find significant temporal trends relating to the popularity of certain file types, the origin of file content, the way the namespace is used, and the degree of variation among file systems, as well as more pedestrian changes in sizes and capacities. We give examples of consequent lessons for designers of file systems and related software.