RCS—a system for version control
Software—Practice & Experience
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
A vector space model for automatic indexing
Communications of the ACM
Modern Information Retrieval
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
A foundation for vacuuming temporal databases
Data & Knowledge Engineering
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
The Design of the POSTGRES Storage System
VLDB '87 Proceedings of the 13th International Conference on Very Large Data Bases
Change-Centric Management of Versions in an XML Warehouse
Proceedings of the 27th International Conference on Very Large Data Bases
Algorithms for Temporal Query Operators in XML Databases
EDBT '02 Proceedings of the Worshops XMLDM, MDDE, and YRWS on XML-Based Data Management and Multimedia Engineering-Revised Papers
Detecting Changes in XML Documents
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Specification-Based Data Reduction in Dimensional Data Warehouses
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Expiration of Historical Databases
TIME '01 Proceedings of the Eighth International Symposium on Temporal Representation and Reasoning (TIME'01)
Supporting temporal text-containment queries in temporal document databases
Data & Knowledge Engineering
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
A rule-based tool for gradual granular data aggregation
Proceedings of the ACM 14th international workshop on Data Warehousing and OLAP
Hi-index | 0.00 |
With rapidly decreasing storage costs, temporal document databases are now a viable solution in many contexts. However, storing an ever-growing database can still be too costly, and as a consequence it is desirable to be able to physically delete old versions of data. Traditionally, this has been performed by an operation called vacuuming, where the oldest versions are physically deleted or migrated from secondary storage to less costly tertiary storage. In temporal document databases on the other hand, it is often more appropriate to remove intermediate versions instead of removing the oldest versions. We call this operation granularity reduction. In this paper we describe the concept of granularity reduction, and present six strategies for selecting the document versions to eliminate. Three of the strategies have been implemented in the V2 temporal document database system, and in this context we discuss the cost of applying the strategies.