Laissez-faire file sharing: access control designed for individuals at the endpoints
NSPW '09 Proceedings of the 2009 workshop on New security paradigms workshop
Maximizing efficiency by trading storage for computation
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
Nectar: automatic management of data and computation in datacenters
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Usage behavior of a large-scale scientific archive
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Hi-index | 0.00 |
In high-performance scientific computing, users output millions of files per project or simulation, resulting in petabytes of information. Little is known about how users make sense of it all, and what the major usability issues are in interacting with a file system at scale. We conducted interviews with scientists at national laboratories to identify common practices and issues with current peta-scale file system usage. The major usability problem encountered in the interviews was the purge threat, triggered when the parallel file system reaches capacity, and warning users about impending data loss. We show that the threat is not communicated to the users of the system in a meaningful way. We present three methods scientists used to address the purge threat--analysis, automation, and subversion--and discuss how subversion of the purging system is a clear indication of its lack of utility and indicative of its cognitive complexity. We define reactionary and cautionary archiving and draw a parallel between archiving methods and data production paradigms. Finally, we propose two non-hierarchical file and directory representation models to address the purge threat.