PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth
Proceedings of the 17th International Conference on Data Engineering
Group-Based Management of Distributed File Caches
ICDCS '02 Proceedings of the 22 nd International Conference on Distributed Computing Systems (ICDCS'02)
The Panasas ActiveScale Storage Cluster: Delivering Scalable High Bandwidth Storage
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Hi-index | 0.00 |
Capturing the advantages of NAS and SAN, Object-based Storage System (OBSS) is becoming an active research area. OBSS is composed of clients, metadata server, and Object-based Storage Devices (OSDs). Similar to the prefetching of file and block, object prefetching is also an effective method to reduce the object access latency in OSD. However, the existing methods focus on intra-object prefetching [1] and they limit the prefetching capability among objects to a certain extent. In this paper, we present a novel inter-object prefetching approach which considers the characteristics of the object access in OSD. This approach adopts a kind of data mining technique called sequential pattern mining to find the object access sequential patterns [2], then produces the object sequential rules according to the patterns, and finally prefetches objects according to the rules and the object access context. The whole process includes the following three modules: Object Request Service Module. It maintains a current object access context, which is a l-length recently accessed object sequence. Furthermore, it appends new object information to the historic object access stream when an object is accessed. Sequential Rules Mining Module. It is composed of the object sequences preprocessing, the sequential pattern mining and the sequential rules producing modules. The preprocessing module divides the object access stream into many subsequences. Base on the discovered object sequential patterns gained by the sequential pattern mining module, object access sequential rules table is constructed. Prefetching Engine Module. When the current requested object is not in the object cache, the prefetching engine selects a matched rule with the highest confidence from the rules table according to the context, then chooses the corresponding objects and preloads them into the object cache. Simulated experimental results show that both prediction accuracy and object hit rate can be improved comparing with the Group-based prefetching method [3] which can also be used for object prefetching.