Static and adaptive distributed data replication using genetic algorithms

Authors:
Thanasis Loukopoulos;Ishfaq Ahmad
Affiliations:
Department of Computer Science, The Hong Kong University of Science and Technology, Kowloon, Hong Kong;Department of Computer Science and Engineering, The University of Texas at Arlintgon, P.O. Box 19015, 248 D/E Nedderman Hall, 416 Yates St., Arlington, TX 16019-0015, USA
Venue:
Journal of Parallel and Distributed Computing
Year:
2004

Citing 17
Cited 15

An adaptive data replication algorithm

ACM Transactions on Database Systems (TODS)
Internet Web servers: workload characterization and performance implications

IEEE/ACM Transactions on Networking (TON)
Efficient scheduling of arbitrary task graphs to multiprocessors using a parallel genetic algorithm

Journal of Parallel and Distributed Computing - Special issue on parallel evolutionary computing
Replication, consistency, and practicality: are these mutually exclusive?

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
An iterative approach for rules and data allocation in distributed deductive database systems

Proceedings of the seventh international conference on Information and knowledge management
Web caching and replication

Web caching and replication
Changes in Web client access patterns: Characteristics and caching implications

World Wide Web
WWW Traffic Reduction and Load Balancing through Server-Based Caching

IEEE Parallel & Distributed Technology: Systems & Technology
Multiprocessor Document Allocation: A Genetic Algorithm Approach

IEEE Transactions on Knowledge and Data Engineering
Enhancing the Web's Infrastructure: From Caching to Replication

IEEE Internet Computing
The case for geographical push-caching

HOTOS '95 Proceedings of the Fifth Workshop on Hot Topics in Operating Systems (HotOS-V)
Replicating the Contents of a WWW Multimedia Repository to Minimize Download Time

IPDPS '00 Proceedings of the 14th International Symposium on Parallel and Distributed Processing
WebWave: Globally Load Balanced Fully Distributed Caching of Hot Published Documents

ICDCS '97 Proceedings of the 17th International Conference on Distributed Computing Systems (ICDCS '97)
Static and Adaptive Data Replication Algorithms for Fast Information Access in Large Distributed Systems

ICDCS '00 Proceedings of the The 20th International Conference on Distributed Computing Systems ( ICDCS 2000)
A Dynamic Object Replication and Migration Protocol for an Internet Hosting Service

ICDCS '99 Proceedings of the 19th IEEE International Conference on Distributed Computing Systems
Optimal File Allocation in a Multiple Computer System

IEEE Transactions on Computers
Design and evaluation of data allocation algorithms for distributed multimedia database systems

IEEE Journal on Selected Areas in Communications

A Powerful Direct Mechanism for Optimal WWW Content Replication

IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
A Pure Nash Equilibrium Guaranteeing Game Theoretical Replica Allocation Method for Reducing Web Access Time

ICPADS '06 Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1
Distributed Selfish Replication

IEEE Transactions on Parallel and Distributed Systems
Comparison and analysis of ten static heuristics-based Internet data replication techniques

Journal of Parallel and Distributed Computing
A policy of conflict negotiation based on fuzzy matter element particle swarm optimization in distributed collaborative creative design

Computer-Aided Design
Applying genetic algorithms to decision making in autonomic computing systems

ICAC '09 Proceedings of the 6th international conference on Autonomic computing
Replicating web contents using a hybrid particle swarm optimization

Information Processing and Management: an International Journal
Replicating data objects in large distributed database systems: an axiomatic game theoretic mechanism design approach

Distributed and Parallel Databases
Plato: a genetic algorithm approach to run-time reconfiguration in autonomic computing systems

Cluster Computing
Review: A survey on content-centric technologies for the current Internet: CDN and P2P solutions

Computer Communications
A history-based scheduler for dynamic load balancing on distributed VOD server environments

ICCSA'10 Proceedings of the 2010 international conference on Computational Science and Its Applications - Volume Part III
An evolutionary approach to schema partitioning selection in a data warehouse

DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
Local utility aware content replication

NETWORKING'05 Proceedings of the 4th IFIP-TC6 international conference on Networking Technologies, Services, and Protocols; Performance of Computer and Communication Networks; Mobile and Wireless Communication Systems
A novel dynamic network data replication scheme based on historical access record and proactive deletion

The Journal of Supercomputing
GCplace: geo-cloud based correlation aware data replica placement

Proceedings of the 28th Annual ACM Symposium on Applied Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Fast dissemination and access of information in large distributed systems, such as the Internet, has become a norm of our daily life. However, undesired long delays experienced by end-users, especially during the peak hours, continue to be a common problem. Replicating some of the objects at multiple sites is one possible solution in decreasing network traffic. The decision of what to replicate where, requires solving a constraint optimization problem which is NP-complete in general. Such problems are known to stretch the capacity of a Genetic Algorithm (GA) to its limits. Nevertheless, we propose a GA to solve the problem when the read/write demands remain static and experimentally prove the superior solution quality obtained compared to an intuitive greedy method. Unfortunately, the static GA approach involves high running time and may not be useful when read/write demands continuously change, as is the case with breaking news. To tackle such case we propose a hybrid GA that takes as input the current replica distribution and computes a new one using knowledge about the network attributes and the changes occurred. Keeping in view more pragmatic scenarios in today's distributed information environments, we evaluate these algorithms with respect to the storage capacity constraint of each site as well as variations in the popularity of objects, and also examine the trade-off between running time and solution quality.