Design and Implementation of a Predictive File Prefetching Algorithm
Proceedings of the General Track: 2002 USENIX Annual Technical Conference
Using Multiple Predictors to Improve the Accuracy of File Access Predictions
MSS '03 Proceedings of the 20 th IEEE/11 th NASA Goddard Conference on Mass Storage Systems and Technologies (MSS'03)
An information-interconnectivity-based retrieval method for network attached storage
Proceedings of the 1st conference on Computing frontiers
Intelligent file management in ubiquitous environments
Proceedings of the 2005 ACM symposium on Applied computing
STEP: Self-Tuning Energy-safe Predictors
Proceedings of the 6th international conference on Mobile data management
Data Staging on Untrusted Surrogates
FAST '03 Proceedings of the 2nd USENIX Conference on File and Storage Technologies
Expecting the unexpected: adaptation for predictive energy conservation
Proceedings of the 2005 ACM workshop on Storage security and survivability
A stochastic approach to file access prediction
SNAPI '03 Proceedings of the international workshop on Storage network architecture and parallel I/Os
A performance-oriented energy efficient file system
SNAPI '04 Proceedings of the international workshop on Storage network architecture and parallel I/Os
Path and cache conscious prefetching (PCCP)
The VLDB Journal — The International Journal on Very Large Data Bases
Energy-efficient disk replacement and file placement techniques for mobile systems with hard disks
Proceedings of the 2007 ACM symposium on Applied computing
Towards higher disk head utilization: extracting free bandwidth from busy disk drives
OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
ACM Transactions on Storage (TOS)
HPDC '08 Proceedings of the 17th international symposium on High performance distributed computing
Proceedings of the 4th Annual International Conference on Wireless Internet
Enhancing prediction accuracy in PCM-based file prefetch by constained pattern replacement algorithm
ICCS'03 Proceedings of the 2003 international conference on Computational science
File access prediction using neural networks
IEEE Transactions on Neural Networks
Reducing seek overhead with application-directed prefetching
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
Freeblock scheduling outside of disk firmware
FAST'02 Proceedings of the 1st USENIX conference on File and storage technologies
Data staging on untrusted surrogates
FAST'03 Proceedings of the 2nd USENIX conference on File and storage technologies
PDDRA: A new pre-fetching based dynamic data replication algorithm in data grids
Future Generation Computer Systems
Autonomic storage system based on automatic learning
HiPC'04 Proceedings of the 11th international conference on High Performance Computing
Effects on performance and energy reduction by file relocation based on file-access correlations
Proceedings of the 2012 Joint EDBT/ICDT Workshops
Hi-index | 0.00 |
Most modern I/O systems treat each file access independently. However, events in a computer system are driven by programs. Thus, accesses to files occur in consistent patterns and are by no means independent. The result is that modern I/O systems ignore useful information. Using traces of file system activity we show that file accesses are strongly correlated with preceding accesses. In fact, a simple last-successor model (one that predicts each file access will be followed by the same file that followed the last time it was accessed) successfully predicted the next file 72% of the time. We examine the ability of two previously proposed models for file access prediction in comparison to this baseline model and see a stark contrast in accuracy and high overheads in state space. We then enhance one of these models to address the issues of model space requirements. This new model is able to improve an additional 10% on the accuracy of the last-successor model, while working within a state space that is within a constant factor (relative to the number of files) of the last-successor model. While this work was motivated by the use of file relationships for I/O prefetching, information regarding the likelihood of file access patterns has several other uses such as disk layout and file clustering for disconnected operation.