The dangers of replication and a solution
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Replication and consistency: being lazy helps sometimes
PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Unifying concurrency control and recovery of transactions with semantically rich operations
Theoretical Computer Science - Special issue: database theory
Update propagation protocols for replicated databates
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
A new approach to developing and implementing eager database replication protocols
ACM Transactions on Database Systems (TODS)
Replication strategies in unstructured peer-to-peer networks
Proceedings of the 2002 conference on Applications, technologies, architectures, and protocols for computer communications
Deferred Updates and Data Placement in Distributed Databases
ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Are quorums an alternative for data replication?
ACM Transactions on Database Systems (TODS)
Adaptive Replication in Peer-to-Peer Systems
ICDCS '04 Proceedings of the 24th International Conference on Distributed Computing Systems (ICDCS'04)
Fine-grained replication and scheduling with freshness and correctness guarantees
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Data currency in replicated DHTs
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Web++: a system for fast and reliable web service
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Adaptive searching and replication of images in mobile hierarchical peer-to-peer networks
Data & Knowledge Engineering
International Journal on Digital Libraries
Replicated data management in the grid: the Re:GRIDiT approach
Proceedings of the 1st ACM workshop on Data grids for eScience
Hi-index | 0.01 |
Data Grids are increasingly popular in novel, demanding and data-intensive eScience applications. In these applications, vast amounts of data, generated by specialized instruments, need to be collaboratively accessed, processed and analyzed by a large number of users spread across several organizations. The nearly unlimited storage capabilities of Data Grids allow these data to be replicated at different sites in order to guarantee a high degree of availability. For updateable data objects, several replicas per object need to be maintained in an eager way. In addition, read-only copies serve users' needs of data with different levels of freshness. The number of updateable replicas has to be dynamically adapted to optimize the trade-off between synchronization overhead and the gain which can be achieved by balancing the load of update transactions. Due to the particular characteristics of the Grid, especially due to the absence of a global coordinator, replication management needs to be provided in a completely distributed way. This includes the synchronization of concurrent updates as well as the dynamic deployment and undeployment of replicas based on actual access characteristics which might change over time. In this paper we present the Re:GRIDiT approach to dynamic replica deployment and undeployment in the Grid. Based on a combination of local load statistics, proximity and data access patterns, Re:GRIDiT dynamically adds new replicas or removes existing ones without impacting global correctness. In addition, we provide a detailed evaluation of the overall performance of the dynamic Re:GRIDiT protocol which shows increased throughput with respect to the replication management protocol with a static number of replicas.