ACM Transactions on Computer Systems (TOCS)
Andrew: a distributed personal computing environment
Communications of the ACM - The MIT Press scientific computation series
File access performance of diskless workstations
ACM Transactions on Computer Systems (TOCS)
Scale and performance in a distributed file system
ACM Transactions on Computer Systems (TOCS)
Caching in the Sprite network file system
ACM Transactions on Computer Systems (TOCS)
The Sprite Network Operating System
Computer
Performance Analysis of Mass Storage Service Alternatives for Distributed Systems
IEEE Transactions on Software Engineering
Beating the I/O bottleneck: a case for log-structured file systems
ACM SIGOPS Operating Systems Review
Leases: an efficient fault-tolerant mechanism for distributed file cache consistency
SOSP '89 Proceedings of the twelfth ACM symposium on Operating systems principles
A synthetic workload model for a distributed system file server
SIGMETRICS '91 Proceedings of the 1991 ACM SIGMETRICS conference on Measurement and modeling of computer systems
VMS file system internals
VAXcluster: a closely-coupled distributed system
ACM Transactions on Computer Systems (TOCS)
Measurement and analysis of locality phases in file referencing behaviour
SIGMETRICS '86/PERFORMANCE '86 Proceedings of the 1986 ACM SIGMETRICS joint international conference on Computer performance modelling, measurement and evaluation
A trace-driven analysis of the UNIX 4.2 BSD file system
Proceedings of the tenth ACM symposium on Operating systems principles
A study of file sizes and functional lifetimes
SOSP '81 Proceedings of the eighth ACM symposium on Operating systems principles
File usage analysis and resource usage prediction: a measurement-based study
File usage analysis and resource usage prediction: a measurement-based study
Optimization of file migration in distributed systems
Optimization of file migration in distributed systems
Parity declustering for continuous operation in redundant disk arrays
ASPLOS V Proceedings of the fifth international conference on Architectural support for programming languages and operating systems
Trace driven analysis of write caching policies for disks
SIGMETRICS '93 Proceedings of the 1993 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Scheduling algorithms for modern disk drives
SIGMETRICS '94 Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems
ACM Transactions on Computer Systems (TOCS)
File-Access Characteristics of Parallel Scientific Workloads
IEEE Transactions on Parallel and Distributed Systems
File system aging—increasing the relevance of file system benchmarks
SIGMETRICS '97 Proceedings of the 1997 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Transactional client-server cache consistency: alternatives and performance
ACM Transactions on Database Systems (TODS)
On the effect and control of self-similar network traffic: a simulation perspective
Proceedings of the 29th conference on Winter simulation
A stochastic disk I/O simulation technique
Proceedings of the 29th conference on Winter simulation
Self-similarity in World Wide Web traffic: evidence and possible causes
IEEE/ACM Transactions on Networking (TON)
A large-scale study of file-system contents
SIGMETRICS '99 Proceedings of the 1999 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
File system usage in Windows NT 4.0
Proceedings of the seventeenth ACM symposium on Operating systems principles
IEEE Transactions on Parallel and Distributed Systems
ACM Transactions on Database Systems (TODS)
On the impact of workload burstiness on disk performance
Workload characterization of emerging computer applications
Disk scheduling policies with lookahead
ACM SIGMETRICS Performance Evaluation Review
IEEE MultiMedia
RAID5 Performance with Distributed Sparing
IEEE Transactions on Parallel and Distributed Systems
High performance RAID system by using dual head disk structure
HPC-ASIA '97 Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
Characteristics of I/O traffic in personal computer and server workloads
IBM Systems Journal
Characteristics of production database workloads and the TPC benchmarks
IBM Systems Journal - End-to-end security
Performance Comparison of Mirrored Disk Scheduling Methods with a Shared Non-Volatile Cache
Distributed and Parallel Databases
Analyzing persistent state interactions to improve state management
SIGMETRICS '06/Performance '06 Proceedings of the joint international conference on Measurement and modeling of computer systems
Performance of Two-Disk Failure-Tolerant Disk Arrays
IEEE Transactions on Computers
Flight data recorder: monitoring persistent-state interactions to improve systems management
OSDI '06 Proceedings of the 7th symposium on Operating systems design and implementation
Measurement and analysis of large-scale network file system workloads
ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
Higher reliability redundant disk arrays: Organization, operation, and coding
ACM Transactions on Storage (TOS)
Why specialized disks for composite operations may be unnecessary
ACM SIGARCH Computer Architecture News
Survey and analysis of disk scheduling methods
ACM SIGARCH Computer Architecture News
Design implications for enterprise storage systems via multi-dimensional trace analysis
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
A file is not a file: understanding the I/O behavior of Apple desktop applications
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Rebuild processing in RAID5 with emphasis on the supplementary parity augmentation method[37]
ACM SIGARCH Computer Architecture News
A File Is Not a File: Understanding the I/O Behavior of Apple Desktop Applications
ACM Transactions on Computer Systems (TOCS)
A study on data deduplication in HPC storage systems
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Hi-index | 0.00 |
Improving the performance of the file system is becoming increasingly important to alleviate the effect of I/O bottlenecks in computer systems. To design changes to an existing file system or to architect a new file system it is important to understand current usage patterns. In this paper we analyze file I/O traces of several existing production computer sytems to understand file access behavior.Our analysis suggests that a relatively small percentage of the files are active. The amount of total data active is also quite small for interactive environments. An average file encounters a relatively small number of file opens while receiving an order of magnitude larger number of reads to it. An average process opens quite a large number of files over a typical prime time period. What is more significant is that the effect of outliers on many of the characteristics we studied is dominant. A relatively small number of processes dominate the activity, and a very small number of files receive most of these operations.In addition, we provide a comprehensive analysis of the dynamic sharing of files in each of these enviroments, addressing both the simultaneous and sequential sharing aspects, and the activity to these shared files. We observe that although only a third of the active files are sequentially shared, they receive a very large proportion of the total operations. We analyze the traces from a given environment across different lengths of time, such as one hour, three hour and whole work-day intervals and do this for 3 different environments. This gives us an idea of the shortest length of the trace needed to have confidence in the estimation of the parameters.