Pattern matching and pattern discovery in scientific, program, and document databases
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Data management for earth system science
ACM SIGMOD Record
Microsoft TerraServer: a spatial data warehouse
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Designing and mining multi-terabyte astronomy archives: the Sloan Digital Sky Survey
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Communications of the ACM
Optimizing Scientific Databases for Client Side Data Processing
EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
A Database Platform for Bioinformatics
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Data Mining in the Bioinformatics Domain
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
WaveCluster: a wavelet-based clustering approach for spatial data in very large databases
The VLDB Journal — The International Journal on Very Large Data Bases
Sharing Experiences from Scientific Experiments
SSDBM '99 Proceedings of the 11th International Conference on Scientific and Statistical Database Management
Efficient exploration of large scientific databases
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Static conflict analysis for multi-threaded object-oriented programs
PLDI '03 Proceedings of the ACM SIGPLAN 2003 conference on Programming language design and implementation
Proceedings of the 33rd annual international symposium on Computer Architecture
Efficient lineage tracking for scientific workflows
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Extending DBMSs with satellite databases
The VLDB Journal — The International Journal on Very Large Data Bases
Personal Workspace for Large-Scale Data-Driven Computational Experiment
GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
ROARS: a scalable repository for data intensive scientific computing
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Epidemic marketplace: an information management system for epidemiological data
ITBAM'10 Proceedings of the First international conference on Information technology in bio- and medical informatics
A call to arms: revisiting database design
ACM SIGMOD Record
Data vaults: a symbiosis between database technology and scientific file repositories
SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
ROARS: a robust object archival system for data intensive scientific computing
Distributed and Parallel Databases
Management and storage of in situ oceanographic data: An ECM-based approach
Information Systems
Turning scientists into data explorers
Proceedings of the 2013 Sigmod/PODS Ph.D. symposium on PhD symposium
Hi-index | 0.00 |
Managing scientific data warehouses requires constant adaptations to cope with changes in processing algorithms, computing environments, database schemas, and usage patterns. We have faced this challenge in the RHESSI Experimental Data Center (HEDC), a datacenter for the RHESSI NASA spacecraft. In this paper we describe our experience in developing HEDC and discuss in detail the design choices made. To successfully accommodate typical adaptations encountered in scientific data management systems, HEDC (i) clearly separates generic from domain specific code in all tiers, (ii) uses a file system for the actual data in combination with a DBMS to manage the corresponding meta data, and (iii) revolves around a middle tier designed to scale if more browsing or processing power is required. These design choices are valuable contributions as they address common concerns in a wide range of scientific data management systems.