Reducing the Storage Burden via Data Deduplication

Authors:
David Geer
Affiliations:
-
Venue:
Computer
Year:
2008

Citing 0
Cited 3

On compressing the textual web

Proceedings of the third ACM international conference on Web search and data mining
A running time improvement for the two thresholds two divisors algorithm

Proceedings of the 48th Annual Southeast Regional Conference
Creating knowledge out of interlinked data: making the web a data washing machine

Proceedings of the International Conference on Web Intelligence, Mining and Semantics

Quantified Score

Hi-index	4.10

Visualization

Abstract

Companies are collecting and storing huge amounts of data, much of it redundant. Many organizations are turning to data deduplication to reduce these huge information volumes, as well as the equipment and operational costs they entail.