The design and implementation of a log-structured file system
SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
Measurements of a distributed file system
SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
On the self-similar nature of Ethernet traffic (extended version)
IEEE/ACM Transactions on Networking (TON)
A quantitative analysis of cache policies for scalable network file systems
SIGMETRICS '94 Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Wide-area traffic: the failure of Poisson modeling
SIGCOMM '94 Proceedings of the conference on Communications architectures, protocols and applications
SIGCOMM '95 Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
The HP AutoRAID hierarchical storage system
SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
A trace-driven analysis of the UNIX 4.2 BSD file system
Proceedings of the tenth ACM symposium on Operating systems principles
Self-Similar ("Fractal") Traffic in ATM Networks
IWACA '94 Proceedings of the Second International Workshop on Multimedia: Advanced Teleservices and High-Speed Communication Architectures
Explaining World Wide Web Traffic Self-Similarity
Explaining World Wide Web Traffic Self-Similarity
Characteristics of File System Workloads
Characteristics of File System Workloads
A large-scale study of file-system contents
SIGMETRICS '99 Proceedings of the 1999 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
File system usage in Windows NT 4.0
Proceedings of the seventeenth ACM symposium on Operating systems principles
Architectural considerations for next generation file systems
MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Implications of proxy caching for provisioning networks and servers
Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
A disstributed backoff algorithm to support real-time traffic on ethernet
ACM SIGOPS Operating Systems Review
Bandwidth allocation in a self-managing multimedia file server
MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
On the impact of workload burstiness on disk performance
Workload characterization of emerging computer applications
Capturing the spatio-temporal behavior of real traffic data
Performance Evaluation
Comparing Logs and Models of Parallel Workloads Using the Co-plot Method
IPPS/SPDP '99/JSSPP '99 Proceedings of the Job Scheduling Strategies for Parallel Processing
Performance Evaluation with Heavy Tailed Distributions
JSSPP '01 Revised Papers from the 7th International Workshop on Job Scheduling Strategies for Parallel Processing
Workload Characterization Issues and Methodologies
Performance Evaluation: Origins and Directions
Workload Modeling for Performance Evaluation
Performance Evaluation of Complex Systems: Techniques and Tools, Performance 2002, Tutorial Lectures
Performance Evaluation with Heavy Tailed Distributions
TOOLS '00 Proceedings of the 11th International Conference on Computer Performance Evaluation: Modelling Techniques and Tools
Architectural considerations for next-generation file systems
Multimedia Systems
ADMiRe: an algebraic approach to system performance analysis using data mining techniques
Proceedings of the 2003 ACM symposium on Applied computing
Grid resource management
Characteristics of I/O traffic in personal computer and server workloads
IBM Systems Journal
Hierarchical Dynamics, Interarrival Times, and Performance
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
ADMiRe: An Algebraic Data Mining Approach to System Performance Analysis
IEEE Transactions on Knowledge and Data Engineering
Aqueduct: Online Data Migration with Performance Guarantees
FAST '02 Proceedings of the 1st USENIX Conference on File and Storage Technologies
CEFT: A cost-effective, fault-tolerant parallel virtual file system
Journal of Parallel and Distributed Computing
A comparison of file system workloads
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
Performance impacts of autocorrelated flows in multi-tiered systems
Performance Evaluation
A five-year study of file-system metadata
ACM Transactions on Storage (TOS)
Flight data recorder: monitoring persistent-state interactions to improve systems management
OSDI '06 Proceedings of the 7th symposium on Operating systems design and implementation
Evaluating block-level optimization through the IO path
ATC'07 2007 USENIX Annual Technical Conference on Proceedings of the USENIX Annual Technical Conference
Measurement and analysis of large-scale network file system workloads
ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
Execution context optimization for disk energy
CASES '08 Proceedings of the 2008 international conference on Compilers, architectures and synthesis for embedded systems
Generating realistic impressions for file-system benchmarking
FAST '09 Proccedings of the 7th conference on File and storage technologies
Capture, conversion, and analysis of an intense NFS workload
FAST '09 Proccedings of the 7th conference on File and storage technologies
RTG: a recursive realistic graph generator using random typing
Data Mining and Knowledge Discovery
RTG: A Recursive Realistic Graph Generator Using Random Typing
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Higher reliability redundant disk arrays: Organization, operation, and coding
ACM Transactions on Storage (TOS)
Generating realistic impressions for file-system benchmarking
ACM Transactions on Storage (TOS)
System support for scalable and fault tolerant internet services
Middleware '98 Proceedings of the IFIP International Conference on Distributed Systems Platforms and Open Distributed Processing
ACM Transactions on Sensor Networks (TOSN)
Self-similarity in SPLASH-2 workloads on shared memory multiprocessors systems
EURO-PDP'00 Proceedings of the 8th Euromicro conference on Parallel and distributed processing
Proceedings of the sixth conference on Computer systems
Self-similarity: Behind workload reshaping and prediction
Future Generation Computer Systems
Design implications for enterprise storage systems via multi-dimensional trace analysis
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
WINE'05 Proceedings of the First international conference on Internet and Network Economics
Why simple timeout strategies work perfectly in practice?
ICESS'04 Proceedings of the First international conference on Embedded Software and Systems
Detecting data theft using stochastic forensics
Digital Investigation: The International Journal of Digital Forensics & Incident Response
Fuzzy adaptive control for heterogeneous tasks in high-performance storage systems
Proceedings of the 6th International Systems and Storage Conference
DupLESS: server-aided encryption for deduplicated storage
SEC'13 Proceedings of the 22nd USENIX conference on Security
(Big)data in a virtualized world: volume, velocity, and variety in cloud datacenters
FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
Hi-index | 0.00 |
We demonstrate that high-level file system events exhibit self-similar behaviour, but only for short-term time scales of approximately under a day. We do so through the analysis of four sets of traces that span time scales of milliseconds through months, and that differ in the trace collection method, the filesystems being traced, and the chronological times of the tracing. Two sets of detailed, short-term file system trace data are analyzed; both are shown to have self-similar like behaviour, with consistent Hurst parameters (a measure of self-similarity) for all file system traffic as well as individual classes of file system events. Long-term file system trace data is then analyzed, and we discover that the traces' high variability and self-similar behaviour does not persist across time scales of days, weeks, and months. Using the short-term trace data, we show that sources of file system traffic exhibit ON/OFF source behaviour, which is characterized by highly variably lengthed bursts of activity, followed by similarly variably lengthed periods of inactivity. This ON/OFF behaviour is used to motivate a simple technique for synthesizing a stream of events that exhibit the same self-similar short-term behaviour as was observed in the file system traces.