A case for intelligent disks (IDISKs)
ACM SIGMOD Record
Data Management: NetCDF: an Interface for Scientific Data Access
IEEE Computer Graphics and Applications
Active Storage for Large-Scale Data Mining and Multimedia
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Evaluating MapReduce for Multi-core and Multiprocessor Systems
HPCA '07 Proceedings of the 2007 IEEE 13th International Symposium on High Performance Computer Architecture
Evaluation of active storage strategies for the lustre parallel file system
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
stdchk: A Checkpoint Storage System for Desktop Grid Computing
ICDCS '08 Proceedings of the 2008 The 28th International Conference on Distributed Computing Systems
Efficient management of idleness in storage systems
ACM Transactions on Storage (TOS)
FAWN: a fast array of wimpy nodes
Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
Accelerating data-intensive science with Gordon and Dash
Proceedings of the 2010 TeraGrid Conference
Enhancing Checkpoint Performance with Staging IO and SSD
SNAPI '10 Proceedings of the 2010 International Workshop on Storage Network Architecture and Parallel I/Os
Efficient object storage journaling in a distributed parallel file system
FAST'10 Proceedings of the 8th USENIX conference on File and storage technologies
Enabling active storage on parallel I/O software stacks
MSST '10 Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)
Provisioning a Multi-tiered Data Staging Area for Extreme-Scale Machines
ICDCS '11 Proceedings of the 2011 31st International Conference on Distributed Computing Systems
Understanding and Improving Computational Science Storage Access through Continuous Characterization
ACM Transactions on Storage (TOS)
FTI: high performance fault tolerance interface for hybrid systems
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Enhancing I/O throughput via efficient routing and placement for large-scale parallel file systems
PCCC '11 Proceedings of the 30th IEEE International Performance Computing and Communications Conference
Reducing data movement costs using energy efficient, active computation on SSD
HotPower'12 Proceedings of the 2012 USENIX conference on Power-Aware Computing and Systems
BLAS: Block-level adaptive striping for solid-state drives
ACM Transactions on Design Automation of Electronic Systems (TODAES)
Hi-index | 0.00 |
Modern scientific discovery is increasingly driven by large-scale supercomputing simulations, followed by data analysis tasks. These data analyses are either performed offline, on smaller-scale clusters, or on the supercomputer itself. Unfortunately, these techniques suffer from performance and energy inefficiencies due to increased data movement between the compute and storage subsystems. Therefore, we propose Active Flash, an insitu scientific data analysis approach, wherein data analysis is conducted on the solid-state device (SSD), where the data already resides. Our performance and energy models show that Active Flash has the potential to address many of the aforementioned concerns without degrading HPC simulation performance. In addition, we demonstrate an Active Flash prototype built on a commercial SSD controller, which further reaffirms the viability of our proposal.