Data vaults: a symbiosis between database technology and scientific file repositories

  • Authors:
  • Milena Ivanova;Martin Kersten;Stefan Manegold

  • Affiliations:
  • Centrum Wiskunde & Informatica (CWI), Amsterdam, The Netherlands;Centrum Wiskunde & Informatica (CWI), Amsterdam, The Netherlands;Centrum Wiskunde & Informatica (CWI), Amsterdam, The Netherlands

  • Venue:
  • SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this short paper we outline the data vault, a database-attached external file repository. It provides a true symbiosis between a DBMS and existing file-based repositories. Data is kept in its original format while scalable processing functionality is provided through the DBMS facilities. In particular, it provides transparent access to all data kept in the repository through an (array-based) query language using the file-type specific scientific libraries. The design space for data vaults is characterized by requirements coming from various fields. We present a reference architecture for their realization in (commercial) DBMSs and a concrete implementation in MonetDB for remote sensing data geared at content-based image retrieval.