C-store: a column-oriented DBMS
VLDB '05 Proceedings of the 31st international conference on Very large data bases
MapReduce: simplified data processing on large clusters
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Breaking the memory wall in MonetDB
Communications of the ACM - Surviving the data deluge
A demonstration of SciDB: a science-oriented DBMS
Proceedings of the VLDB Endowment
SciQL, a query language for science applications
Proceedings of the EDBT/ICDT 2011 Workshop on Array Databases
Hybrid merge/overlap execution technique for parallel array processing
Proceedings of the EDBT/ICDT 2011 Workshop on Array Databases
A cloud-enabled regional climate model evaluation system
Proceedings of the 2nd International Workshop on Software Engineering for Cloud Computing
ArrayStore: a storage manager for complex parallel array processing
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Database-as-a-service for long-tail science
SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Performance analysis of a dual-tree algorithm for computing spatial distance histograms
The VLDB Journal — The International Journal on Very Large Data Bases
ISABELA-QA: query-driven analytics with ISABELA-compressed extreme-scale scientific data
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
SciHadoop: array-based query processing in Hadoop
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
SciQL: bridging the gap between science and relational DBMS
Proceedings of the 15th Symposium on International Database Engineering & Applications
Towards scalable array-oriented active storage: the pyramid approach
ACM SIGOPS Operating Systems Review
Optimizing I/O for big array analytics
Proceedings of the VLDB Endowment
Data-intensive spatial filtering in large numerical simulation datasets
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Expressive Query Support for Multidimensional Data in Distributed Hash Tables
UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
Future Generation Computer Systems
EarthDB: scalable analysis of MODIS data using SciDB
Proceedings of the 1st ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data
SciQL: array data processing inside an RDBMS
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Cumulon: optimizing statistical data analysis in the cloud
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Proceedings of the 25th International Conference on Scientific and Statistical Database Management
Real-time collaborative analysis with (almost) pure SQL: a case study in biogeochemical oceanography
Proceedings of the 25th International Conference on Scientific and Statistical Database Management
Run-time creation of the turbulent channel flow database by an HPC simulation using MPI-DB
Proceedings of the 20th European MPI Users' Group Meeting
Autonomously improving query evaluations over multidimensional data in distributed hash tables
Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference
SIDR: structure-aware intelligent data routing in Hadoop
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
pEDM: online-forecasting for smart energy analytics
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Can we analyze big data inside a DBMS?
Proceedings of the sixteenth international workshop on Data warehousing and OLAP
On-demand unstructured mesh translation for reducing memory pressure during in situ analysis
UltraVis '13 Proceedings of the 8th International Workshop on Ultrascale Visualization
A demonstration of iterative parallel array processing in support of telescope image analysis
Proceedings of the VLDB Endowment
SDS: a framework for scientific data services
PDSW '13 Proceedings of the 8th Parallel Data Storage Workshop
GeoMix: scalable geoscientific array data management
Proceedings of the Industrial Track of the 13th ACM/IFIP/USENIX International Middleware Conference
Trends and outlook for the massive-scale analytics stack
IBM Journal of Research and Development
Hi-index | 0.00 |
SciDB [4, 3] is a new open-source data management system intended primarily for use in application domains that involve very large (petabyte) scale array data; for example, scientific applications such as astronomy, remote sensing and climate modeling, bio-science information management, risk management systems in financial applications, and the analysis of web log data. In this talk we will describe our set of motivating examples and use them to explain the features of SciDB. We then briefly give an overview of the project 'in flight', explaining our novel storage manager, array data model, query language, and extensibility frameworks.