Scale and performance in a distributed file system
ACM Transactions on Computer Systems (TOCS)
A case for redundant arrays of inexpensive disks (RAID)
SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
Replication in the harp file system
SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
Serverless network file systems
SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
Frangipani: a scalable distributed file system
Proceedings of the sixteenth ACM symposium on Operating systems principles
A cost-effective, high-bandwidth storage architecture
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Cluster I/O with River: making the fast case common
Proceedings of the sixth workshop on I/O in parallel and distributed systems
GPFS: A Shared-Disk File System for Large Computing Clusters
FAST '02 Proceedings of the Conference on File and Storage Technologies
Cheap recovery: a key to self-managing state
ACM Transactions on Storage (TOS)
Dynamic Metadata Management for Petabyte-Scale File Systems
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Supporting Cluster-Based Network Services on Functionally Symmetric Software Architecture
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
A Self-Organizing Storage Cluster for Parallel Data-Intensive Applications
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
FS: An In-Kernel Integrity Checker and Intrusion Detection File System
LISA '04 Proceedings of the 18th USENIX conference on System administration
"One Size Fits All": An Idea Whose Time Has Come and Gone
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Deep Store: An Archival Storage System Architecture
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
An Efficient Topology-Adaptive Membership Protocol for Large-Scale Cluster-Based Services
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
NetSolve/D: A Massively Parallel Grid Execution System for Scalable Data Intensive Collaboration
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 10 - Volume 11
A model for characterizing the scalability of distributed systems
ACM SIGOPS Operating Systems Review
Enterprise Software as Service
Queue - Enterprise Distributed Computing
Systems Support for Preemptive Disk Scheduling
IEEE Transactions on Computers
Proceedings of the twentieth ACM symposium on Operating systems principles
Ensuring data integrity in storage: techniques and applications
Proceedings of the 2005 ACM workshop on Storage security and survivability
Separating Abstractions from Resources in a Tactical Storage System
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
FreeLoader: Scavenging Desktop Storage Resources for Scientific Data
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Efficient and decentralized computation of approximate global state
ACM SIGCOMM Computer Communication Review
On the Benefits of aWorkflow-Aware File System in High-Performance Computing Systems
HPCASIA '05 Proceedings of the Eighth International Conference on High-Performance Computing in Asia-Pacific Region
CEFT: A cost-effective, fault-tolerant parallel virtual file system
Journal of Parallel and Distributed Computing
BambooTrust: practical scalable trust management for global public computing
Proceedings of the 2006 ACM symposium on Applied computing
Building a research library for the history of the web
Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Want scalable computing?: speculate!
ACM SIGACT News
Constructing collaborative desktop storage caches for large scientific datasets
ACM Transactions on Storage (TOS)
Tuning file system block addressing for performance
Proceedings of the 44th annual Southeast regional conference
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
CRUSH: controlled, scalable, decentralized placement of replicated data
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Deriving distribution of thread service time in layered queueing networks
WOSP '07 Proceedings of the 6th international workshop on Software and performance
The SMART way to migrate replicated stateful services
Proceedings of the 1st ACM SIGOPS/EuroSys European Conference on Computer Systems 2006
A heterogeneous storage grid enabled by grid service
ACM SIGOPS Operating Systems Review
Recovering transient data: automated on-demand data reconstruction and offloading for supercomputers
ACM SIGOPS Operating Systems Review
Survey of research towards robust peer-to-peer networks: search methods
Computer Networks: The International Journal of Computer and Telecommunications Networking
Interpreting the data: Parallel analysis with Sawzall
Scientific Programming - Dynamic Grids and Worldwide Computing
Exploring high performance distributed file storage using LDPC codes
Parallel Computing
Extending ACID semantics to the file system
ACM Transactions on Storage (TOS)
Detecting near-duplicates for web crawling
Proceedings of the 16th international conference on World Wide Web
Proceedings of the 16th international conference on World Wide Web
BitVault: a highly reliable distributed data retention platform
ACM SIGOPS Operating Systems Review - Systems work at Microsoft Research
Xen and the art of repeated research
ATEC '04 Proceedings of the annual conference on USENIX Annual Technical Conference
Ursa minor: versatile cluster-based storage
FAST'05 Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies - Volume 4
STAR: an efficient coding scheme for correcting triple storage node failures
FAST'05 Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies - Volume 4
The many faces of systems research: and how to evaluate them
HOTOS'05 Proceedings of the 10th conference on Hot Topics in Operating Systems - Volume 10
Explicit control a batch-aware distributed file system
NSDI'04 Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation - Volume 1
Chain replication for supporting high throughput and availability
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Boxwood: abstractions as the foundation for storage infrastructure
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
MapReduce: simplified data processing on large clusters
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Bridging local and wide area networks for overlay distributed file systems
WORLDS'05 Proceedings of the 2nd conference on Real, Large Distributed Systems - Volume 2
Bigtable: a distributed storage system for structured data
OSDI '06 Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7
Availability of multi-object operations
NSDI'06 Proceedings of the 3rd conference on Networked Systems Design & Implementation - Volume 3
Disk failures in the real world: what does an MTTF of 1,000,000 hours mean to you?
FAST '07 Proceedings of the 5th USENIX conference on File and Storage Technologies
Partial content distribution on high performance networks
Proceedings of the 16th international symposium on High performance distributed computing
Direct-pNFS: scalable, transparent, and versatile access to parallel file systems
Proceedings of the 16th international symposium on High performance distributed computing
Dryad: distributed data-parallel programs from sequential building blocks
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Antiquity: exploiting a secure log for wide-area distributed storage
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Proactive fault tolerance for HPC with Xen virtualization
Proceedings of the 21st annual international conference on Supercomputing
A global and parallel file system for grids
Future Generation Computer Systems - Special section: Data mining in grid computing environments
Fast generation of result snippets in web search
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Paxos made live: an engineering perspective
Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing
Optimal inter-object correlation when replicating for availability
Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing
Object replication degree customization for high availability
Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing
Parallel test generation and execution with Korat
Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Understanding disk failure rates: What does an MTTF of 1,000,000 hours mean to you?
ACM Transactions on Storage (TOS)
Google's MapReduce programming model — Revisited
Science of Computer Programming
Low-overhead byzantine fault-tolerant storage
Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
Sinfonia: a new paradigm for building scalable distributed systems
Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
Dynamo: amazon's highly available key-value store
Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
Improving file system reliability with I/O shepherding
Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
Stasis: flexible transactional storage
OSDI '06 Proceedings of the 7th symposium on Operating systems design and implementation
Bigtable: a distributed storage system for structured data
OSDI '06 Proceedings of the 7th symposium on Operating systems design and implementation
Ceph: a scalable, high-performance distributed file system
OSDI '06 Proceedings of the 7th symposium on Operating systems design and implementation
Distributed directory service in the Farsite file system
OSDI '06 Proceedings of the 7th symposium on Operating systems design and implementation
The Chubby lock service for loosely-coupled distributed systems
OSDI '06 Proceedings of the 7th symposium on Operating systems design and implementation
A Feasibility Study of a Virtual Storage System for Large Organizations
VTDC '06 Proceedings of the 2nd International Workshop on Virtualization Technology in Distributed Computing
Exploiting type-awareness in a self-recovering disk
Proceedings of the 2007 ACM workshop on Storage security and survivability
Towards efficient search on unstructured data: an intelligent-storage approach
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Google's MapReduce programming model – Revisited
Science of Computer Programming
Niobe: A practical replication protocol
ACM Transactions on Storage (TOS)
MapReduce: simplified data processing on large clusters
Communications of the ACM - 50th anniversary issue: 1958 - 2008
Stork: package management for distributed VM environments
LISA'07 Proceedings of the 21st conference on Large Installation System Administration Conference
Cluster computing for web-scale data processing
Proceedings of the 39th SIGCSE technical symposium on Computer science education
Replication degree customization for high availability
Proceedings of the 3rd ACM SIGOPS/EuroSys European Conference on Computer Systems 2008
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
NFS-cc: tuning NFS for concurrent read sharing
International Journal of High Performance Computing and Networking
Scalable security for petascale parallel file systems
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Evaluation of active storage strategies for the lustre parallel file system
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Optimizing center performance through coordinated data staging, scheduling and recovery
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Pergamum: replacing tape with energy efficient, reliable, disk-based archival storage
FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
Scalable performance of the Panasas parallel file system
FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
Measurement and analysis of TCP throughput collapse in cluster-based storage systems
FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
An analysis of data corruption in the storage stack
FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
Bigtable: A Distributed Storage System for Structured Data
ACM Transactions on Computer Systems (TOCS)
A nine year study of file system and storage benchmarking
ACM Transactions on Storage (TOS)
On application-level approaches to avoiding TCP throughput collapse in cluster-based storage systems
PDSW '07 Proceedings of the 2nd international workshop on Petascale data storage: held in conjunction with Supercomputing '07
Searching and navigating petabyte-scale file systems based on facets
PDSW '07 Proceedings of the 2nd international workshop on Petascale data storage: held in conjunction with Supercomputing '07
RADOS: a scalable, reliable storage service for petabyte-scale storage clusters
PDSW '07 Proceedings of the 2nd international workshop on Petascale data storage: held in conjunction with Supercomputing '07
Data management projects at Google
ACM SIGMOD Record
Shifted declustering: a placement-ideal layout scheme for multi-way replication storage architecture
Proceedings of the 22nd annual international conference on Supercomputing
FaTLease: scalable fault-tolerant lease negotiation with paxos
HPDC '08 Proceedings of the 17th international symposium on High performance distributed computing
HPDC '08 Proceedings of the 17th international symposium on High performance distributed computing
File grouping for scientific data management: lessons from experimenting with real traces
HPDC '08 Proceedings of the 17th international symposium on High performance distributed computing
Accelerating large-scale data exploration through data diffusion
DADC '08 Proceedings of the 2008 international workshop on Data-aware distributed computing
Proceedings of the first international conference on Networks for grid applications
XROOTD/TXNetFile: a highly scalable architecture for data access in the ROOT environment
TELE-INFO'05 Proceedings of the 4th WSEAS International Conference on Telecommunications and Informatics
Zyzzyva: speculative Byzantine fault tolerance
Communications of the ACM - Remembering Jim Gray
High-performance land surface modeling with a Linux cluster
Computers & Geosciences
Data mining using high performance data clouds: experimental studies using sector and sphere
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
A scalable, commodity data center network architecture
Proceedings of the ACM SIGCOMM 2008 conference on Data communication
Dcell: a scalable and fault-tolerant network structure for data centers
Proceedings of the ACM SIGCOMM 2008 conference on Data communication
ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
Free factories: unified infrastructure for data intensive web services
ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
Towards practical intrusion tolerant systems: a blueprint
Proceedings of the 4th annual workshop on Cyber security and information intelligence research: developing strategies to meet the cyber security and information intelligence challenges ahead
Proactive process-level live migration in HPC environments
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
A scalable parallel framework for analyzing terascale molecular dynamics simulation trajectories
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Compute and storage clouds using wide area high performance networks
Future Generation Computer Systems
Meeting the data challenge: curriculum development for parallel data systems
SIGITE '08 Proceedings of the 9th ACM SIGITE conference on Information technology education
ACM Transactions on Storage (TOS)
An analysis of data corruption in the storage stack
ACM Transactions on Storage (TOS)
Large-Scale Parallel Collaborative Filtering for the Netflix Prize
AAIM '08 Proceedings of the 4th international conference on Algorithmic Aspects in Information and Management
User Defined Partitioning - Group Data Based on Computation Model
DaWaK '08 Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery
Towards realistic file-system benchmarks with CodeMRI
ACM SIGMETRICS Performance Evaluation Review
Fault-tolerant stream processing using a distributed, replicated file system
Proceedings of the VLDB Endowment
SCOPE: easy and efficient parallel processing of massive data sets
Proceedings of the VLDB Endowment
PNUTS: Yahoo!'s hosted data serving platform
Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment
Configurable security for scavenged storage systems
Proceedings of the 4th ACM international workshop on Storage security and survivability
GRIMS: a scalable management and storage system for massive remote sensing images
Proceedings of the 3rd international conference on Scalable information systems
Kinesis: A new approach to replica placement in distributed storage systems
ACM Transactions on Storage (TOS)
Managing Very-Large Distributed Datasets
OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part I on On the Move to Meaningful Internet Systems:
Criteria to Compare Cloud Computing with Current Database Technology
IWSM/Metrikon/Mensura '08 Proceedings of the International Conferences on Software Process and Product Measurement
Replication in Peer-to-Peer Systems
IWSOS '08 Proceedings of the 3rd International Workshop on Self-Organizing Systems
Umbrella file system: Storage management across heterogeneous devices
ACM Transactions on Storage (TOS)
An implementation of parallel file distribution in an agent hierarchy
The Journal of Supercomputing
Gordon: using flash memory to build fast, power-efficient clusters for data-intensive applications
Proceedings of the 14th international conference on Architectural support for programming languages and operating systems
Journal of Parallel and Distributed Computing
Teaching large scale data processing: the five-week course and two years' experiences
SCE '08 Proceedings of the 1st ACM Summit on Computing Education in China on First ACM Summit on Computing Education in China
Clouder: a flexible large scale decentralized object store: architecture overview
Proceedings of the Third Workshop on Dependable Distributed Data Management
A Service-Oriented Architecture to enable virtual storage services: a dynamic collaboration context
International Journal of Ad Hoc and Ubiquitous Computing
HYDRAstor: a Scalable Secondary Storage
FAST '09 Proccedings of the 7th conference on File and storage technologies
Smoke and mirrors: reflecting files at a geographically remote location without loss of performance
FAST '09 Proccedings of the 7th conference on File and storage technologies
Defining weakly consistent Byzantine fault-tolerant services
LADIS '08 Proceedings of the 2nd Workshop on Large-Scale Distributed Systems and Middleware
LADIS '08 Proceedings of the 2nd Workshop on Large-Scale Distributed Systems and Middleware
Reducing the costs of large-scale BFT replication
LADIS '08 Proceedings of the 2nd Workshop on Large-Scale Distributed Systems and Middleware
Factored operating systems (fos): the case for a scalable operating system for multicores
ACM SIGOPS Operating Systems Review
Architecture of the internet archive
SYSTOR '09 Proceedings of SYSTOR 2009: The Israeli Experimental Systems Conference
R-ADMAD: high reliability provision for large-scale de-duplication archival storage systems
Proceedings of the 23rd international conference on Supercomputing
The quest for scalable support of data-intensive workloads in distributed systems
Proceedings of the 18th ACM international symposium on High performance distributed computing
Exploring data reliability tradeoffs in replicated storage systems
Proceedings of the 18th ACM international symposium on High performance distributed computing
A distributed architecture for data mining and integration
Proceedings of the second international workshop on Data-aware distributed computing
Proceedings of the 2009 workshop on Resiliency in high performance
Making cluster applications energy-aware
ACDC '09 Proceedings of the 1st workshop on Automated control for datacenters and clouds
Tashi: location-aware cluster management
ACDC '09 Proceedings of the 1st workshop on Automated control for datacenters and clouds
Toward a cloud computing research agenda
ACM SIGACT News
Open-source grid technologies for web-scale computing
ACM SIGACT News
Pairwise document similarity in large collections with MapReduce
HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Flexible, wide-area storage for distributed systems with WheelFS
NSDI'09 Proceedings of the 6th USENIX symposium on Networked systems design and implementation
Zeno: eventually consistent Byzantine-fault tolerance
NSDI'09 Proceedings of the 6th USENIX symposium on Networked systems design and implementation
A comparison of approaches to large-scale data analysis
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Experiences on Processing Spatial Data with MapReduce
SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
What's inside the Cloud? An architectural map of the Cloud landscape
CLOUD '09 Proceedings of the 2009 ICSE Workshop on Software Engineering Challenges of Cloud Computing
Brute force and indexed approaches to pairwise document similarity comparisons with MapReduce
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
GMount: An Ad Hoc and Locality-Aware Distributed File System by Using SSH and FUSE
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
File Clustering Based Replication Algorithm in a Grid Environment
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
PortLand: a scalable fault-tolerant layer 2 data center network fabric
Proceedings of the ACM SIGCOMM 2009 conference on Data communication
BCube: a high performance, server-centric network architecture for modular data centers
Proceedings of the ACM SIGCOMM 2009 conference on Data communication
ROAR: increasing the flexibility and performance of distributed search
Proceedings of the ACM SIGCOMM 2009 conference on Data communication
Safe and effective fine-grained TCP retransmissions for datacenter communication
Proceedings of the ACM SIGCOMM 2009 conference on Data communication
Understanding TCP incast throughput collapse in datacenter networks
Proceedings of the 1st ACM workshop on Research on enterprise networking
Why should we integrate services, servers, and networking in a data center?
Proceedings of the 1st ACM workshop on Research on enterprise networking
Access-pattern and bandwidth aware file replication algorithm in a grid environment
GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
GMount: Build your grid file system on the fly
GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
Towards Efficient MapReduce Using MPI
Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
FaTLease: scalable fault-tolerant lease negotiation with Paxos
Cluster Computing
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A DHT Key-Value Storage System with Carrier Grade Performance
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Journal of Computing Sciences in Colleges
UNIMARC-XML performance testing
Proceedings of the 2008 Euro American Conference on Telematics and Information Systems
Fast, easy, and cheap: construction of statistical machine translation models with MapReduce
StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
Exploring large-data issues in the curriculum: a case study with MapReduce
TeachCL '08 Proceedings of the Third Workshop on Issues in Teaching Computational Linguistics
Load balancing and fault-tolerance for scalable network file systems using by web services
ICCOMP'09 Proceedings of the WSEAES 13th international conference on Computers
Dynamic load balancing for I/O-intensive applications on clusters
ACM Transactions on Storage (TOS)
Sinfonia: A new paradigm for building scalable distributed systems
ACM Transactions on Computer Systems (TOCS)
MapReduce: a flexible data processing tool
Communications of the ACM - Amir Pnueli: Ahead of His Time
FAWN: a fast array of wimpy nodes
Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
Quincy: fair scheduling for distributed computing clusters
Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
New frontiers in internet network management
ACM SIGCOMM Computer Communication Review
Proceedings of the First Asia-Pacific Symposium on Internetware
The nature of data center traffic: measurements & analysis
Proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference
Practical lessons of data mining at Yahoo!
Proceedings of the 18th ACM conference on Information and knowledge management
Lessons learned from a year's worth of benchmarks of large data clouds
Proceedings of the 2nd Workshop on Many-Task Computing on Grids and Supercomputers
Modeling and evaluation of serial multicast remote procedure calls (RPCs)
IEEE Communications Letters
Query processing of massive trajectory data based on mapreduce
Proceedings of the first international workshop on Cloud data management
An efficient multi-dimensional index for cloud data management
Proceedings of the first international workshop on Cloud data management
Leveraging a scalable row store to build a distributed text index
Proceedings of the first international workshop on Cloud data management
Adaptive and scalable metadata management to support a trillion files
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
What is analytic infrastructure and why should you care?
ACM SIGKDD Explorations Newsletter
EDFS: a semi-centralized efficient distributed file system
Proceedings of the 10th ACM/IFIP/USENIX International Conference on Middleware
MDCube: a high performance network structure for modular data center interconnection
Proceedings of the 5th international conference on Emerging networking experiments and technologies
NGI'09 Proceedings of the 5th Euro-NGI conference on Next Generation Internet networks
A unified interface for visual and interactive web search
CIIT '07 The Sixth IASTED International Conference on Communications, Internet, and Information Technology
Churn-Resilient Replication Strategy for Peer-to-Peer Distributed Hash-Tables
SSS '09 Proceedings of the 11th International Symposium on Stabilization, Safety, and Security of Distributed Systems
Cloud Computing Boosts Business Intelligence of Telecommunication Industry
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
An Efficient Cloud Computing-Based Architecture for Freight System Application in China Railway
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
Evaluating MapReduce on Virtual Machines: The Hadoop Case
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
Towards a Theory of Universally Composable Cloud Computing
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
Parallel K-Means Clustering Based on MapReduce
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
A Data Distribution Aware Task Scheduling Strategy for MapReduce System
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
BlobSeer: how to enable efficient versioning for large object storage under heavy access concurrency
Proceedings of the 2009 EDBT/ICDT Workshops
What can visual content analysis do for text based image search?
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Design of a hierarchical global scale cluster system
ICACT'09 Proceedings of the 11th international conference on Advanced Communication Technology - Volume 3
Ganesha: blackBox diagnosis of MapReduce systems
ACM SIGMETRICS Performance Evaluation Review
Mixing Hadoop and HPC workloads on parallel filesystems
Proceedings of the 4th Annual Workshop on Petascale Data Storage
DiskReduce: RAID for data-intensive scalable computing
Proceedings of the 4th Annual Workshop on Petascale Data Storage
Uncovering errors: the cost of detecting silent data corruption
Proceedings of the 4th Annual Workshop on Petascale Data Storage
Existence and construction of capacity-achieving network codes for distributed storage
IEEE Journal on Selected Areas in Communications
Optimizing joins in a map-reduce environment
Proceedings of the 13th International Conference on Extending Database Technology
DEDUCE: at the intersection of MapReduce and stream processing
Proceedings of the 13th International Conference on Extending Database Technology
The case for a versatile storage system
ACM SIGOPS Operating Systems Review
ACM SIGOPS Operating Systems Review
On the energy (in)efficiency of Hadoop clusters
ACM SIGOPS Operating Systems Review
Mining dependency in distributed systems through unstructured logs analysis
ACM SIGOPS Operating Systems Review
Boom analytics: exploring data-centric, declarative programming for the cloud
Proceedings of the 5th European conference on Computer systems
Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling
Proceedings of the 5th European conference on Computer systems
PIBUS: a network memory-based peer-to-peer IO buffering service
NETWORKING'07 Proceedings of the 6th international IFIP-TC6 conference on Ad Hoc and sensor networks, wireless networks, next generation internet
Cassandra: a decentralized structured storage system
ACM SIGOPS Operating Systems Review
Harnessing input redundancy in a MapReduce framework
Proceedings of the 2010 ACM Symposium on Applied Computing
Semi-join computation on distributed file systems using map-reduce-merge model
Proceedings of the 2010 ACM Symposium on Applied Computing
Content cloaking: preserving privacy with Google Docs and other web applications
Proceedings of the 2010 ACM Symposium on Applied Computing
Towards scalable architectures for clickstream data warehousing
DNIS'07 Proceedings of the 5th international conference on Databases in networked information systems
Distributed indexing of web scale datasets for the cloud
Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud
Towards transparent personal content storage in multi-service access networks
EUC'07 Proceedings of the 2007 international conference on Embedded and ubiquitous computing
A distributed filesystem for spare storage
EUROCAST'07 Proceedings of the 11th international conference on Computer aided systems theory
IITA'09 Proceedings of the 3rd international conference on Intelligent information technology application
Supporting extended UNIX remove semantics in the OASIS cluster filesystem
ICCSA'07 Proceedings of the 2007 international conference on Computational science and its applications - Volume Part I
GUESSTIMATE: a programming model for collaborative distributed systems
PLDI '10 Proceedings of the 2010 ACM SIGPLAN conference on Programming language design and implementation
FlumeJava: easy, efficient data-parallel pipelines
PLDI '10 Proceedings of the 2010 ACM SIGPLAN conference on Programming language design and implementation
Lithium: virtual machine storage for the cloud
Proceedings of the 1st ACM symposium on Cloud computing
Stateful bulk processing for incremental analytics
Proceedings of the 1st ACM symposium on Cloud computing
Comet: batched stream processing for data intensive distributed computing
Proceedings of the 1st ACM symposium on Cloud computing
Making cloud intermediate data fault-tolerant
Proceedings of the 1st ACM symposium on Cloud computing
A self-organized, fault-tolerant and scalable replication scheme for cloud storage
Proceedings of the 1st ACM symposium on Cloud computing
Robust and flexible power-proportional storage
Proceedings of the 1st ACM symposium on Cloud computing
Pregel: a system for large-scale graph processing
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Indexing multi-dimensional data in a cloud system
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Integrating hadoop and parallel DBMs
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
A comparison of join algorithms for log processing in MaPreduce
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Data warehousing and analytics infrastructure at facebook
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Automated control for elastic storage
Proceedings of the 7th international conference on Autonomic computing
MuSE: multimedia search engine
Companion Proceedings of the XIV Brazilian Symposium on Multimedia and the Web
Optimal recovery of single disk failure in RDP code storage systems
Proceedings of the ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Web-scale computer vision using MapReduce for multimedia data mining
Proceedings of the Tenth International Workshop on Multimedia Data Mining
Parallel programming framework for large batch transaction processing on scale-out systems
Proceedings of the 3rd Annual Haifa Experimental Systems Conference
Energy proportional datacenter networks
Proceedings of the 37th annual international symposium on Computer architecture
Design a cloud storage platform for pervasive computing environments
Cluster Computing
Design patterns for efficient graph algorithms in MapReduce
Proceedings of the Eighth Workshop on Mining and Learning with Graphs
Toward a cost-effective cloud storage service
ICACT'10 Proceedings of the 12th international conference on Advanced communication technology
Automated tools for manipulating files in a distributed environment with RSYNC
ICACT'10 Proceedings of the 12th international conference on Advanced communication technology
Efficient partial-duplicate detection based on sequence matching
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Eventually linearizable shared objects
Proceedings of the 29th ACM SIGACT-SIGOPS symposium on Principles of distributed computing
Malstone: towards a benchmark for analytics on large data clouds
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
A data placement strategy in scientific cloud workflows
Future Generation Computer Systems
Service Oriented Approach to High Performance Scientific Computing
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Rigel: A Scalable and Lightweight Replica Selection Service for Replicated Distributed File System
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Dynamic Load-Balanced Multicast for Data-Intensive Applications on Clouds
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
An Analysis of Traces from a Production MapReduce Cluster
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Generic and automatic address configuration for data center networks
Proceedings of the ACM SIGCOMM 2010 conference
Symbiotic routing in future data centers
Proceedings of the ACM SIGCOMM 2010 conference
Energy-aware routing in data center network
Proceedings of the first ACM SIGCOMM workshop on Green networking
MOON: MapReduce On Opportunistic eNvironments
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
MRAP: a novel MapReduce-based framework to support HPC analytics applications with access patterns
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
A GPU accelerated storage system
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
XCo: explicit coordination to prevent network fabric congestion in cloud computing cluster platforms
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Lightning: self-adaptive, energy-conserving, multi-zoned, commodity green cloud storage system
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
An overview of the Open Science Data Cloud
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
ROARS: a scalable repository for data intensive scientific computing
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Twister: a runtime for iterative MapReduce
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Pydoop: a Python MapReduce and HDFS API for Hadoop
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
MR-scope: a real-time tracing tool for MapReduce
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
The byzantine empire in the intercloud
ACM SIGACT News
Accelerating parallel analysis of scientific simulation data via Zazen
FAST'10 Proceedings of the 8th USENIX conference on File and storage technologies
Panache: a parallel file system cache for global file access
FAST'10 Proceedings of the 8th USENIX conference on File and storage technologies
HydraFS: a high-throughput file system for the HYDRAstor content-addressable storage system
FAST'10 Proceedings of the 8th USENIX conference on File and storage technologies
ElasTraS: an elastic transactional data store in the cloud
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
In search of an API for scalable file systems: under the table or above it?
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
Cloud analytics: do we really need to reinvent the storage stack?
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
Mochi: visual log-analysis based tools for debugging hadoop
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
DryadInc: reusing work in large-scale computations
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
Operating systems should provide transactions
HotOS'09 Proceedings of the 12th conference on Hot topics in operating systems
Delivering energy proportionality with non energy-proportional systems: optimizing the ensemble
HotPower'08 Proceedings of the 2008 conference on Power aware computing and systems
Hedera: dynamic flow scheduling for data center networks
NSDI'10 Proceedings of the 7th USENIX conference on Networked systems design and implementation
Prophecy: using history for high-throughput fault tolerance
NSDI'10 Proceedings of the 7th USENIX conference on Networked systems design and implementation
Everest: scaling down peak loads through I/O off-loading
OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
SQCK: a declarative file system checker
OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
ShadowNet: a platform for rapid and safe network evolution
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
Object storage on CRAQ: high-throughput chain replication for read-mostly workloads
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
A transparently-scalable metadata service for the Ursa Minor storage system
USENIXATC'10 Proceedings of the 2010 USENIX conference on USENIX annual technical conference
SALSA: analyzing logs as state machines
WASL'08 Proceedings of the First USENIX conference on Analysis of system logs
Independent faults in the cloud
Proceedings of the 4th International Workshop on Large Scale Distributed Systems and Middleware
Data-centric reconfiguration with network-attached disks
Proceedings of the 4th International Workshop on Large Scale Distributed Systems and Middleware
Scripting the cloud with skywriting
HotCloud'10 Proceedings of the 2nd USENIX conference on Hot topics in cloud computing
Peer-to-peer bargaining in container-based datacenters
IPTPS'10 Proceedings of the 9th international conference on Peer-to-peer systems
Towards energy proportional cloud for data processing frameworks
SustainIT'10 Proceedings of the First USENIX conference on Sustainable information technology
Chain replication in theory and in practice
Proceedings of the 9th ACM SIGPLAN workshop on Erlang
On securing untrusted clouds with cryptography
Proceedings of the 9th annual ACM workshop on Privacy in the electronic society
SideCar: building programmable datacenter networks without programmable switches
Hotnets-IX Proceedings of the 9th ACM SIGCOMM Workshop on Hot Topics in Networks
ESQP: an efficient SQL query processing for cloud data management
CloudDB '10 Proceedings of the second international workshop on Cloud data management
Adaptive query execution for data management in the cloud
CloudDB '10 Proceedings of the second international workshop on Cloud data management
Benchmarking cloud-based data management systems
CloudDB '10 Proceedings of the second international workshop on Cloud data management
Proceedings of the FSE/SDP workshop on Future of software engineering research
Software engineering in an uncertain world
Proceedings of the FSE/SDP workshop on Future of software engineering research
XML structural similarity search using mapreduce
WAIM'10 Proceedings of the 11th international conference on Web-age information management
Managing Variability in the IO Performance of Petascale Storage Systems
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
High throughput data-compression for cloud storage
Globe'10 Proceedings of the Third international conference on Data management in grid and peer-to-peer systems
Multidimensional arrays for warehousing data on clouds
Globe'10 Proceedings of the Third international conference on Data management in grid and peer-to-peer systems
Data life time for different placement policies in P2P storage systems
Globe'10 Proceedings of the Third international conference on Data management in grid and peer-to-peer systems
RSEDP: an effective hybrid data placement algorithm for large-scale storage systems
The Journal of Supercomputing
PH2: an hadoop-based framework for mining structural properties from the PDB database
SAICSIT '10 Proceedings of the 2010 Annual Research Conference of the South African Institute of Computer Scientists and Information Technologists
BlobSeer: Next-generation data management for large scale infrastructures
Journal of Parallel and Distributed Computing
Energy management for MapReduce clusters
Proceedings of the VLDB Endowment
Dremel: interactive analysis of web-scale datasets
Proceedings of the VLDB Endowment
Efficient B-tree based indexing for cloud data processing
Proceedings of the VLDB Endowment
Frontiers of Computer Science in China
Knuckles: bringing the database to the data
International Journal of Computational Science and Engineering
Towards automatically checking thousands of failures with micro-specifications
HotDep'10 Proceedings of the Sixth international conference on Hot topics in system dependability
Focus replay debugging effort on the control plane
HotDep'10 Proceedings of the Sixth international conference on Hot topics in system dependability
GreenHDFS: towards an energy-conserving, storage-efficient, hybrid Hadoop compute cluster
HotPower'10 Proceedings of the 2010 international conference on Power aware computing and systems
Finding a needle in Haystack: facebook's photo storage
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Availability in globally distributed storage systems
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Large-scale incremental processing using distributed transactions and notifications
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Chukwa: a system for reliable large-scale log collection
LISA'10 Proceedings of the 24th international conference on Large installation system administration
On the expressiveness and trade-offs of large scale tuple stores
OTM'10 Proceedings of the 2010 international conference on On the move to meaningful internet systems: Part II
Dynamic proportional share scheduling in Hadoop
JSSPP'10 Proceedings of the 15th international conference on Job scheduling strategies for parallel processing
dBug: systematic evaluation of distributed systems
SSV'10 Proceedings of the 5th international conference on Systems software verification
Modeling files with context streams
UIC'10 Proceedings of the 7th international conference on Ubiquitous intelligence and computing
Cooperative caching versus proactive replication for location dependent request patterns
Journal of Network and Computer Applications
Using Paxos to build a scalable, consistent, and highly available datastore
Proceedings of the VLDB Endowment
Future Generation Computer Systems
Optimal file-distribution in heterogeneous and asymmetric storage networks
SOFSEM'11 Proceedings of the 37th international conference on Current trends in theory and practice of computer science
CPLDP: an efficient large dataset processing system built on cloud platform
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II
Map-reduce extensions and recursive queries
Proceedings of the 14th International Conference on Extending Database Technology
Energy proportionality for disk storage using replication
Proceedings of the 14th International Conference on Extending Database Technology
Dremel: interactive analysis of web-scale datasets
Communications of the ACM
Social Services Computing: Concepts, Research Challenges, and Directions
GREENCOM-CPSCOM '10 Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing
Scalable and cost-effective interconnection of data-center servers using dual server ports
IEEE/ACM Transactions on Networking (TON)
Improving throughput for small disk requests with proximal I/O
FAST'11 Proceedings of the 9th USENIX conference on File and stroage technologies
The SCADS director: scaling a distributed storage system under stringent performance requirements
FAST'11 Proceedings of the 9th USENIX conference on File and stroage technologies
Scale and concurrency of GIGA+: file system directories with millions of files
FAST'11 Proceedings of the 9th USENIX conference on File and stroage technologies
AONT-RS: blending security and performance in dispersed storage systems
FAST'11 Proceedings of the 9th USENIX conference on File and stroage technologies
DepSky: dependable and secure storage in a cloud-of-clouds
Proceedings of the sixth conference on Computer systems
Sierra: practical power-proportionality for data center storage
Proceedings of the sixth conference on Computer systems
Scarlett: coping with skewed content popularity in mapreduce clusters
Proceedings of the sixth conference on Computer systems
Wireless link scheduling for data center networks
Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
PRESIDIO: A Framework for Efficient Archival Data Storage
ACM Transactions on Storage (TOS)
Application flow control in YouTube video streams
ACM SIGCOMM Computer Communication Review
ASTERIX: towards a scalable, semistructured data platform for evolving-world models
Distributed and Parallel Databases
TritonSort: a balanced large-scale sorting system
Proceedings of the 8th USENIX conference on Networked systems design and implementation
Diagnosing performance changes by comparing request flows
Proceedings of the 8th USENIX conference on Networked systems design and implementation
CIEL: a universal execution engine for distributed data-flow computing
Proceedings of the 8th USENIX conference on Networked systems design and implementation
Paxos replicated state machines as the basis of a high-performance data store
Proceedings of the 8th USENIX conference on Networked systems design and implementation
FATE and DESTINI: a framework for cloud recovery testing
Proceedings of the 8th USENIX conference on Networked systems design and implementation
Scalability limits of Bag-of-Tasks applications running on hierarchical platforms
Journal of Parallel and Distributed Computing
Towards improved load balancing for data intensive distributed computing
Proceedings of the 2011 ACM Symposium on Applied Computing
A cloud-enabled regional climate model evaluation system
Proceedings of the 2nd International Workshop on Software Engineering for Cloud Computing
ASDF: an automated, online framework for diagnosing performance problems
Architecting dependable systems VII
A hadoop-based packet trace processing tool
TMA'11 Proceedings of the Third international conference on Traffic monitoring and analysis
Distributed and fault-tolerant execution framework for transaction processing
Proceedings of the 4th Annual International Conference on Systems and Storage
Proceedings of the International Conference on Web Intelligence, Mining and Semantics
The impact of applications' I/O strategies on the performance of the Lustre parallel file system
International Journal of High Performance Systems Architecture
Providing scalable database services on the cloud
WISE'10 Proceedings of the 11th international conference on Web information systems engineering
Preference driven server selection in peer-2-peer data sharing systems
Proceedings of the fourth international workshop on Data-intensive distributed computing
Making a case for distributed file systems at Exascale
Proceedings of the third international workshop on Large-scale system and application performance
An application-level solution for the TCP-incast problem in data center networks
Proceedings of the Nineteenth International Workshop on Quality of Service
Full-text indexing for optimizing selection operations in large-scale data analytics
Proceedings of the second international workshop on MapReduce and its applications
The case for being lazy: how to leverage lazy evaluation in MapReduce
Proceedings of the 2nd international workshop on Scientific cloud computing
Enhancement of Xen's scheduler for MapReduce workloads
Proceedings of the 20th international symposium on High performance distributed computing
Towards efficient subgraph search in cloud computing environments
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications
Adapting skyline computation to the MapReduce framework: algorithms and experiments
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications
Optimized data placement for column-oriented data store in the distributed environment
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications
Database scalability, elasticity, and autonomy in the cloud
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
Design and implementation of live image file feeding to dome theaters
Future Generation Computer Systems
PigSPARQL: mapping SPARQL to Pig Latin
Proceedings of the International Workshop on Semantic Web Information Management
Gatekeeper: supporting bandwidth guarantees for multi-tenant datacenter networks
WIOV'11 Proceedings of the 3rd conference on I/O virtualization
Online migration for geo-distributed storage systems
USENIXATC'11 Proceedings of the 2011 USENIX conference on USENIX annual technical conference
TidyFS: a simple and small distributed file system
USENIXATC'11 Proceedings of the 2011 USENIX conference on USENIX annual technical conference
In search of I/O-optimal recovery from disk failures
HotStorage'11 Proceedings of the 3rd USENIX conference on Hot topics in storage and file systems
A load-balance based resource-scheduling algorithm under cloud computing environment
ICWL'10 Proceedings of the 2010 international conference on New horizons in web-based learning
Dynamic bandwidth allocation for preventing congestion in data center networks
ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part III
CassMail: a scalable, highly-available, and rapidly-prototyped e-mail service
Proceedings of the 11th IFIP WG 6.1 international conference on Distributed applications and interoperable systems
A Hybrid Approach to Failed Disk Recovery Using RAID-6 Codes: Algorithms and Performance Evaluation
ACM Transactions on Storage (TOS)
On the benefits of transparent compression for cost-effective cloud data storage
Transactions on large-scale data- and knowledge-centered systems III
HTAF: hybrid testing automation framework to leverage local and global computing resources
ICCSA'11 Proceedings of the 2011 international conference on Computational science and its applications - Volume Part III
Disco: a computing platform for large-scale data analytics
Proceedings of the 10th ACM SIGPLAN workshop on Erlang
An efficient quad-tree based index structure for cloud data management
WAIM'11 Proceedings of the 12th international conference on Web-age information management
Small cache, big effect: provable load balancing for randomly partitioned cluster services
Proceedings of the 2nd ACM Symposium on Cloud Computing
Architecture-based run-time fault diagnosis
ECSA'11 Proceedings of the 5th European conference on Software architecture
Fast crash recovery in RAMCloud
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
A file is not a file: understanding the I/O behavior of Apple desktop applications
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Windows Azure Storage: a highly available cloud storage service with strong consistency
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Detecting failures in distributed systems with the Falcon spy network
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Transactional storage for geo-replicated systems
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Don't settle for eventual: scalable causal consistency for wide-area storage with COPS
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
An adaptive distribution model for multi-dimensional data in decentralized environments
WISM'11 Proceedings of the 2011 international conference on Web information systems and mining - Volume Part I
How to tell if your cloud files are vulnerable to drive crashes
Proceedings of the 18th ACM conference on Computer and communications security
Concurrent non-deferred reference counting on the Microgrid: first experiences
IFL'10 Proceedings of the 22nd international conference on Implementation and application of functional languages
Repairing Flocks in Peer-to-Peer Networks
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
SPOTlight on testing: stability, performance and operational testing of LANL HPC clusters
State of the Practice Reports
On the duality of data-intensive file system design: reconciling HDFS and PVFS
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Incremental recomputations in MapReduce
Proceedings of the third international workshop on Cloud data management
Efficient data distribution strategy for join query processing in the cloud
Proceedings of the third international workshop on Cloud data management
Demo: Uno: a sharing infrastructure for smartphone sensors and files
Proceedings of the 9th ACM Conference on Embedded Networked Sensor Systems
Towards reliable storage systems
Towards reliable storage systems
QoS-enabled distributed mutual exclusion in public clouds
OTM'11 Proceedings of the 2011th Confederated international conference on On the move to meaningful internet systems - Volume Part II
Scalable queries for large datasets using cloud computing: a case study
Proceedings of the 15th Symposium on International Database Engineering & Applications
Chimera: data sharing flexibility, shared nothing simplicity
Proceedings of the 15th Symposium on International Database Engineering & Applications
MARIANE: MApReduce Implementation Adapted for HPC Environments
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
Benchmarking MapReduce Implementations for Application Usage Scenarios
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
High-Bandwidth remote parallel i/o with the distributed memory filesystem MEMFS
EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface
Eventual consistency: How soon is eventual? An evaluation of Amazon S3's consistency behavior
Proceedings of the 6th Workshop on Middleware for Service Oriented Computing
Parallel data processing with MapReduce: a survey
ACM SIGMOD Record
ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part III
Wukong: A cloud-oriented file service for mobile Internet devices
Journal of Parallel and Distributed Computing
Proactive process-level live migration and back migration in HPC environments
Journal of Parallel and Distributed Computing
A peer-to-peer architecture for data-intensive cycle sharing
Proceedings of the first international workshop on Network-aware data management
Scientific data services: a high-performance I/O system with array semantics
Proceedings of the first annual workshop on High performance computing meets databases
Case study of scientific data processing on a cloud using hadoop
HPCS'09 Proceedings of the 23rd international conference on High Performance Computing Systems and Applications
Riding the elephant: managing ensembles with hadoop
Proceedings of the 2011 ACM international workshop on Many task computing on grids and supercomputers
GHOST: GPGPU-offloaded high performance storage I/O deduplication for primary storage system
Proceedings of the 2012 International Workshop on Programming Models and Applications for Multicores and Manycores
DVM: towards a datacenter-scale virtual machine
VEE '12 Proceedings of the 8th ACM SIGPLAN/SIGOPS conference on Virtual Execution Environments
A proxy service for the xrootd data server
SAG'04 Proceedings of the First international conference on Scientific Applications of Grid Computing
Dependable distributed computing using free databases
ISAS'05 Proceedings of the Second international conference on Service Availability
Data management challenges in cloud computing infrastructures
DNIS'10 Proceedings of the 6th international conference on Databases in Networked Information Systems
A study on workload imbalance issues in data intensive distributed computing
DNIS'10 Proceedings of the 6th international conference on Databases in Networked Information Systems
Jockey: guaranteed job latency in data parallel clusters
Proceedings of the 7th ACM european conference on Computer Systems
Agent based cloud storage system
AIC'10/BEBI'10 Proceedings of the 10th WSEAS international conference on applied informatics and communications, and 3rd WSEAS international conference on Biomedical electronics and biomedical informatics
Synchronization scheme using application-sensitive hint
AIC'10/BEBI'10 Proceedings of the 10th WSEAS international conference on applied informatics and communications, and 3rd WSEAS international conference on Biomedical electronics and biomedical informatics
The datacenter needs an operating system
HotCloud'11 Proceedings of the 3rd USENIX conference on Hot topics in cloud computing
DRO+: a systemic and economical approach to improve availability of massive database systems
WISE'06 Proceedings of the 7th international conference on Web Information Systems
The evolving landscape of data management in the cloud
International Journal of Computational Science and Engineering
DAC: generic and automatic address configuration for data center networks
IEEE/ACM Transactions on Networking (TON)
S-CLONE: Socially-aware data replication for social networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
Foundations and Trends® in Machine Learning
Achieving power-efficiency in clusters without distributed file system complexity
ISCA'10 Proceedings of the 2010 international conference on Computer Architecture
Cluster computing, recursion and datalog
Datalog'10 Proceedings of the First international conference on Datalog Reloaded
RDFPath: path query processing on large RDF graphs with mapreduce
ESWC'11 Proceedings of the 8th international conference on The Semantic Web
Busy bee: how to use traffic information for better scheduling of background tasks
ICPE '12 Proceedings of the 3rd ACM/SPEC International Conference on Performance Engineering
Democratizing transactional programming
Middleware'11 Proceedings of the 12th ACM/IFIP/USENIX international conference on Middleware
Scalable load balancing in cluster storage systems
Middleware'11 Proceedings of the 12th ACM/IFIP/USENIX international conference on Middleware
DPillar: Dual-port server interconnection network for large scale data centers
Computer Networks: The International Journal of Computer and Telecommunications Networking
FAST'12 Proceedings of the 10th USENIX conference on File and Storage Technologies
Rethinking erasure codes for cloud file systems: minimizing I/O for recovery and degraded reads
FAST'12 Proceedings of the 10th USENIX conference on File and Storage Technologies
Google's hybrid approach to research
Communications of the ACM
V-SMART-join: a scalable mapreduce framework for all-pair similarity joins of multisets and vectors
Proceedings of the VLDB Endowment
Advanced partitioning techniques for massively distributed computation
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
bLSM: a general purpose log structured merge tree
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Walnut: a unified cloud object store
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Octopus: An Upperware based system for building personal pervasive environments
Journal of Systems and Software
The personal cloud: design, architecture and matchmaking algorithms for resource management
Hot-ICE'12 Proceedings of the 2nd USENIX conference on Hot Topics in Management of Internet, Cloud, and Enterprise Networks and Services
Building access oblivious storage cloud for enterprise
Hot-ICE'12 Proceedings of the 2nd USENIX conference on Hot Topics in Management of Internet, Cloud, and Enterprise Networks and Services
Camdoop: exploiting in-network aggregation for big data applications
NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
Structured comparative analysis of systems logs to diagnose performance problems
NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
The case for elastic operating system services in fos
Proceedings of the 49th Annual Design Automation Conference
A service-oriented taxonomical spectrum, cloudy challenges and opportunities of cloud computing
International Journal of Communication Systems
RelaxDHT: A churn-resilient replication strategy for peer-to-peer distributed hash-tables
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Inside "Big Data management": ogres, onions, or parfaits?
Proceedings of the 15th International Conference on Extending Database Technology
Transitive closure and recursive Datalog implemented on clusters
Proceedings of the 15th International Conference on Extending Database Technology
Efficient SPARQL query processing in mapreduce through data partitioning and indexing
APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Evaluating spatial keyword queries under the mapreduce framework
DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications
FREP: Energy proportionality for disk storage using replication
Journal of Parallel and Distributed Computing
Pilot-MapReduce: an extensible and flexible MapReduce implementation for distributed data
Proceedings of third international workshop on MapReduce and its Applications Date
Investigation of data locality and fairness in MapReduce
Proceedings of third international workshop on MapReduce and its Applications Date
Projecting disk usage based on historical trends in a cloud environment
Proceedings of the 3rd workshop on Scientific Cloud Computing Date
Enabling event tracing at leadership-class scale through I/O forwarding middleware
Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
Future Generation Computer Systems
Adapting scientific computing problems to clouds using MapReduce
Future Generation Computer Systems
A MapReduce-based distributed SVM algorithm for automatic image annotation
Computers & Mathematics with Applications
A Workflow-Aware Storage System: An Opportunity Study
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Investigation of Data Locality in MapReduce
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Maestro: Replica-Aware Map Scheduling for MapReduce
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
A Cost-Effective Mechanism for Cloud Data Reliability Management Based on Proactive Replica Checking
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Integrated in-system storage architecture for high performance computing
Proceedings of the 2nd International Workshop on Runtime and Operating Systems for Supercomputers
A File Is Not a File: Understanding the I/O Behavior of Apple Desktop Applications
ACM Transactions on Computer Systems (TOCS)
Big data platforms: What's next?
XRDS: Crossroads, The ACM Magazine for Students - Big Data
Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems
Early accurate results for advanced analytics on MapReduce
Proceedings of the VLDB Endowment
MapReduce indexing strategies: Studying scalability and efficiency
Information Processing and Management: an International Journal
Explicit coordination to prevent congestion in data center networks
Cluster Computing
Reliable MapReduce computing on opportunistic resources
Cluster Computing
Reference deployment models for eliminating user concerns on cloud security
The Journal of Supercomputing
Implementation of a distributed data storage system with resource monitoring on cloud computing
GPC'12 Proceedings of the 7th international conference on Advances in Grid and Pervasive Computing
RAMCube: exploiting network proximity for ram-based key-value store
HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Opening up black box networks with CloudTalk
HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Discretized streams: an efficient and fault-tolerant model for stream processing on large clusters
HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Towards fair sharing of block storage in a multi-tenant cloud
HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Automated diagnosis without predictability is a recipe for failure
HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
HotStorage'12 Proceedings of the 4th USENIX conference on Hot Topics in Storage and File Systems
MixApart: decoupled analytics for shared storage systems
HotStorage'12 Proceedings of the 4th USENIX conference on Hot Topics in Storage and File Systems
Gnothi: separating data and metadata for efficient and available storage replication
USENIX ATC'12 Proceedings of the 2012 USENIX conference on Annual Technical Conference
Dynamic reconfiguration of primary/backup clusters
USENIX ATC'12 Proceedings of the 2012 USENIX conference on Annual Technical Conference
Surviving congestion in geo-distributed storage systems
USENIX ATC'12 Proceedings of the 2012 USENIX conference on Annual Technical Conference
Practical hardening of crash-tolerant systems
USENIX ATC'12 Proceedings of the 2012 USENIX conference on Annual Technical Conference
Systematic approach of using power save mode for cloud data processing services
International Journal of Ad Hoc and Ubiquitous Computing
A MapReduce-supported network structure for data centers
Concurrency and Computation: Practice & Experience
Towards a hybrid row-column database for a cloud-based medical data management system
Proceedings of the 1st International Workshop on Cloud Intelligence
Rya: a scalable RDF triple store for the clouds
Proceedings of the 1st International Workshop on Cloud Intelligence
Redundantly grouped cross-object coding for repairable storage
Proceedings of the Asia-Pacific Workshop on Systems
Processing a trillion cells per mouse click
Proceedings of the VLDB Endowment
Serializability, not serial: concurrency control and availability in multi-datacenter datastores
Proceedings of the VLDB Endowment
Proceedings of the WICSA/ECSA 2012 Companion Volume
Social networking with frientegrity: privacy and integrity with an untrusted provider
Security'12 Proceedings of the 21st USENIX conference on Security symposium
Parallel decision tree with application to water quality data analysis
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part II
Scalability of replicated metadata services in distributed file systems
DAIS'12 Proceedings of the 12th IFIP WG 6.1 international conference on Distributed Applications and Interoperable Systems
Solving big data challenges for enterprise application performance management
Proceedings of the VLDB Endowment
M3R: increased performance for in-memory Hadoop jobs
Proceedings of the VLDB Endowment
Efficient big data processing in Hadoop MapReduce
Proceedings of the VLDB Endowment
Parallel implementation of ant-based clustering algorithm based on hadoop
ICSI'12 Proceedings of the Third international conference on Advances in Swarm Intelligence - Volume Part I
ESM: efficient and scalable data center multicast routing
IEEE/ACM Transactions on Networking (TON)
Proceedings of the 2012 workshop on Management of big data systems
Storage provisioning and allocation in a large cloud environment
Proceedings of the 2012 workshop on Management of big data systems
Federated cloud-based big data platform in telecommunications
Proceedings of the 2012 workshop on Cloud services, federation, and the 8th open cirrus summit
HFAA: a generic socket API for Hadoop file systems
Proceedings of the 2nd Workshop on Architectures and Systems for Big Data
Resource utilization prediction: a proposal for information technology research
Proceedings of the 1st Annual conference on Research in information technology
Stitch: A language for architecture-based self-adaptation
Journal of Systems and Software
Balanced partition scheme for distributed caching systems to solve load imbalance problems
ACM SIGSOFT Software Engineering Notes
The Journal of Supercomputing
ROARS: a robust object archival system for data intensive scientific computing
Distributed and Parallel Databases
Robust Redundancy Scheme for the Repair Process: Hierarchical Codes in the Bandwidth-Limited Systems
Journal of Grid Computing
SCOPE: parallel databases meet MapReduce
The VLDB Journal — The International Journal on Very Large Data Bases
Redundantly grouped cross-object coding for repairable storage
APSys'12 Proceedings of the Third ACM SIGOPS Asia-Pacific conference on Systems
OSDI'12 Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
Spanner: Google's globally-distributed database
OSDI'12 Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
T: a data-centric cooling energy costs reduction approach for big data analytics cloud
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Differentially private top-k query over MapReduce
Proceedings of the fourth international workshop on Cloud data management
Coflow: a networking abstraction for cluster applications
Proceedings of the 11th ACM Workshop on Hot Topics in Networks
Improving large graph processing on partitioned graphs in the cloud
Proceedings of the Third ACM Symposium on Cloud Computing
Sailfish: a framework for large scale data processing
Proceedings of the Third ACM Symposium on Cloud Computing
A multi-layer collaborative cache for question answering
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
xOMB: extensible open middleboxes with commodity servers
Proceedings of the eighth ACM/IEEE symposium on Architectures for networking and communications systems
Communications of the ACM
You can stop early with COLA: online processing of aggregate queries in the cloud
Proceedings of the 21st ACM international conference on Information and knowledge management
CloST: a hadoop-based storage system for big spatio-temporal data analytics
Proceedings of the 21st ACM international conference on Information and knowledge management
The xDotGrid native, cross-platform, high-performance xDFS file transfer framework
Computers and Electrical Engineering
A cloud architecture with an efficient scheduling technique
ICICA'12 Proceedings of the Third international conference on Information Computing and Applications
An open-source toolkit for mining Wikipedia
Artificial Intelligence
A Distributed Cache for Hadoop Distributed File System in Real-Time Cloud Services
GRID '12 Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing
Assessing MapReduce for Internet Computing: A Comparison of Hadoop and BitDew-MapReduce
GRID '12 Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing
Droplet: A Distributed Solution of Data Deduplication
GRID '12 Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing
Reducing Storage Overhead with Small Write Bottleneck Avoiding in Cloud RAID System
GRID '12 Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing
Tuning ECN for data center networks
Proceedings of the 8th international conference on Emerging networking experiments and technologies
Datacast: a scalable and efficient reliable group data delivery service for data centers
Proceedings of the 8th international conference on Emerging networking experiments and technologies
Democratizing transactional programming
Proceedings of the 12th International Middleware Conference
Scalable load balancing in cluster storage systems
Proceedings of the 12th International Middleware Conference
Do More Replicas of Object Data Improve the Performance of Cloud Data Centers?
UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
Cogset: a high performance MapReduce engine
Concurrency and Computation: Practice & Experience
Computing scientometrics in large-scale academic search engines with mapreduce
WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
Toward scalable internet traffic measurement and analysis with Hadoop
ACM SIGCOMM Computer Communication Review
TritonSort: A Balanced and Energy-Efficient Large-Scale Sorting System
ACM Transactions on Computer Systems (TOCS)
A RAMCloud Storage System based on HDFS: Architecture, implementation and evaluation
Journal of Systems and Software
Large-scale ranking and selection using cloud computing
Proceedings of the Winter Simulation Conference
Bridging the gap between applications and networks in data centers
ACM SIGOPS Operating Systems Review
Optimizing budget constrained spend in search advertising
Proceedings of the sixth ACM international conference on Web search and data mining
Ursa: Scalable Load and Power Management in Cloud Storage Systems
ACM Transactions on Storage (TOS)
ACM Transactions on Storage (TOS)
Future Generation Computer Systems
A resource scheduling approach for media uploading in video data center
PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Network-Based inference algorithm on hadoop
ISMIS'12 Proceedings of the 20th international conference on Foundations of Intelligent Systems
Analysis for REPERA: A Hybrid Data Protection Mechanism in Distributed Environment
International Journal of Cloud Applications and Computing
Cloud Computing: Locally Sub-Clouds instead of Globally One Cloud
International Journal of Cloud Applications and Computing
Makeflow: a portable abstraction for data intensive computing on clusters, clouds, and grids
Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
A New Electronic Commerce Architecture in the Cloud
Journal of Electronic Commerce in Organizations
Caju: a content distribution system for edge networks
Euro-Par'12 Proceedings of the 18th international conference on Parallel processing workshops
ESOP'13 Proceedings of the 22nd European conference on Programming Languages and Systems
Paragon: QoS-aware scheduling for heterogeneous datacenters
Proceedings of the eighteenth international conference on Architectural support for programming languages and operating systems
A document-based data warehousing approach for large scale data mining
ICPCA/SWS'12 Proceedings of the 2012 international conference on Pervasive Computing and the Networked World
ICPCA/SWS'12 Proceedings of the 2012 international conference on Pervasive Computing and the Networked World
Massive electronic records processing for digital archives in cloud
ICPCA/SWS'12 Proceedings of the 2012 international conference on Pervasive Computing and the Networked World
Exploiting and Evaluating MapReduce for Large-Scale Graph Mining
ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
Two-level Hash/Table approach for metadata management in distributed file systems
The Journal of Supercomputing
Proceedings of the 5th ACM COMPUTE Conference: Intelligent & scalable system technologies
Towards a unified taxonomy and architecture of cloud frameworks
Future Generation Computer Systems
Evaluating Cassandra as a manager of large file sets
Proceedings of the 3rd International Workshop on Cloud Data and Platforms
Indexing and searching 100M images with map-reduce
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Position paper: elastic processing and storage at the edge of the cloud
Proceedings of the 2013 international workshop on Hot topics in cloud services
kMemvisor: flexible system wide memory mirroring in virtual environments
Proceedings of the 22nd international symposium on High-performance parallel and distributed computing
International Journal of Information and Communication Technology
Configurable performance analysis and evaluation framework for cloud systems
International Journal of Information and Communication Technology
MRSG - A MapReduce simulator over SimGrid
Parallel Computing
Photon: fault-tolerant and scalable joining of continuous data streams
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Performance evaluation of a MongoDB and hadoop platform for scientific data analysis
Proceedings of the 4th ACM workshop on Scientific cloud computing
A throughput optimal algorithm for map task scheduling in mapreduce with data locality
ACM SIGMETRICS Performance Evaluation Review
In-network redundancy generation for opportunistic speedup of data backup
Future Generation Computer Systems
A classification of file placement and replication methods on grids
Future Generation Computer Systems
GCplace: geo-cloud based correlation aware data replica placement
Proceedings of the 28th Annual ACM Symposium on Applied Computing
International Journal of Security and Networks
Scalable RDF graph querying using cloud computing
Journal of Web Engineering
Zone-based data striping for cloud storage
IBM Journal of Research and Development
GPFS-SNC: an enterprise storage framework for virtual-machine clouds
IBM Journal of Research and Development
Stronger semantics for low-latency geo-replicated storage
nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
Robustness in the Salus scalable block store
nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
DeepSea: self-adaptive data partitioning and replication in scalable distributed data systems
Proceedings of the 2013 Sigmod/PODS Ph.D. symposium on PhD symposium
DCSP-MC: dependable cloud-based storage platform for mobile computing
International Journal of Networking and Virtual Organisations
On benchmarking online social media analytical queries
First International Workshop on Graph Data Management Experiences and Systems
ISRN Communications and Networking
Octopus: efficient data intensive computing on virtualized datacenters
Proceedings of the 6th International Systems and Storage Conference
QuickSAN: a storage area network for fast, distributed, solid state disks
Proceedings of the 40th Annual International Symposium on Computer Architecture
Leveraging endpoint flexibility in data-intensive clusters
Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM
MapReduce with communication overlap (MaRCO)
Journal of Parallel and Distributed Computing
A high performance peer to cloud and peer model augmented with hierarchical secure communications
Journal of Systems and Software
Diagnosing architectural run-time failures
Proceedings of the 8th International Symposium on Software Engineering for Adaptive and Self-Managing Systems
Future Generation Computer Systems
Performance comparison under failures of MPI and MapReduce: An analytical approach
Future Generation Computer Systems
When cycles are cheap, some tables can be huge
HotOS'13 Proceedings of the 14th USENIX conference on Hot Topics in Operating Systems
The case for tiny tasks in compute clusters
HotOS'13 Proceedings of the 14th USENIX conference on Hot Topics in Operating Systems
Spanner: Google’s Globally Distributed Database
ACM Transactions on Computer Systems (TOCS)
Exploiting Redundancies and Deferred Writes to Conserve Energy in Erasure-Coded Storage Clusters
ACM Transactions on Storage (TOS)
DATE '12 Proceedings of the Conference on Design, Automation and Test in Europe
A case for MapReduce over the internet
Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference
Storage-class memory needs flexible interfaces
Proceedings of the 4th Asia-Pacific Workshop on Systems
Distributed data management using MapReduce
ACM Computing Surveys (CSUR)
FSaaS: Configuring Policies for Managing Shared Files Among Cooperating, Distributed Applications
International Journal of Web Portals
ACIC: automatic cloud I/O configurator for HPC applications
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Algorithms for high-throughput disk-to-disk sorting
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
HAT: history-based auto-tuning MapReduce in heterogeneous environments
The Journal of Supercomputing
"All roads lead to Rome": optimistic recovery for distributed iterative data processing
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Regenerating codes: a system perspective
ACM SIGOPS Operating Systems Review
Data-Intensive Cloud Computing: Requirements, Expectations, Challenges, and Solutions
Journal of Grid Computing
Dynamic Synchronous/Asynchronous Replication
ACM Transactions on Storage (TOS)
Simplifying MapReduce data processing
International Journal of Computational Science and Engineering
International Journal of Grid and High Performance Computing
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
ACM SIGOPS 24th Symposium on Operating Systems Principles
Authenticated storage using small trusted hardware
Proceedings of the 2013 ACM workshop on Cloud computing security workshop
Transaction chains: achieving serializability with low latency in geo-distributed storage systems
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
Discretized streams: fault-tolerant streaming computation at scale
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
The family of mapreduce and large-scale data processing systems
ACM Computing Surveys (CSUR)
A data-centric heuristic for Hadoop provisioning in the cloud
Proceedings of the 6th ACM India Computing Convention
Leveraging sharding in the design of scalable replication protocols
Proceedings of the 4th annual Symposium on Cloud Computing
COLO: COarse-grained LOck-stepping virtual machines for non-stop service
Proceedings of the 4th annual Symposium on Cloud Computing
Cloud-aware processing of MapReduce-based OLAP applications
AusPDC '13 Proceedings of the Eleventh Australasian Symposium on Parallel and Distributed Computing - Volume 140
Utility-Driven share scheduling algorithm in hadoop
ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part II
Representing mapreduce optimisations in the nested relational calculus
BNCOD'13 Proceedings of the 29th British National conference on Big Data
USTO.RE: a private cloud storage software system
ICWE'13 Proceedings of the 13th international conference on Web Engineering
Greening data center networks with throughput-guaranteed power-aware routing
Computer Networks: The International Journal of Computer and Telecommunications Networking
Content-based chunk placement scheme for decentralized deduplication on distributed file systems
ICCSA'13 Proceedings of the 13th international conference on Computational Science and Its Applications - Volume 1
Generating request streams on Big Data using clustered renewal processes
Performance Evaluation
HotStorage'13 Proceedings of the 5th USENIX conference on Hot Topics in Storage and File Systems
P2EST: parallelization philosophies for evaluating spatio-temporal queries
Proceedings of the 2nd ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data
Explicit multipath congestion control for data center networks
Proceedings of the ninth ACM conference on Emerging networking experiments and technologies
Bullet trains: a study of NIC burst behavior at microsecond timescales
Proceedings of the ninth ACM conference on Emerging networking experiments and technologies
Copysets: reducing the frequency of data loss in cloud storage
USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference
Janus: optimal flash provisioning for cloud storage workloads
USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference
TABLEFS: enhancing metadata efficiency in the local file system
USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference
Trevi: watering down storage hotspots with cool fountain codes
Proceedings of the Twelfth ACM Workshop on Hot Topics in Networks
DepSky: Dependable and Secure Storage in a Cloud-of-Clouds
ACM Transactions on Storage (TOS)
Securing data services: a security architecture design for private storage cloud based on HDFS
International Journal of Grid and Utility Computing
Piranha: optimizing short jobs in Hadoop
Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment
Medical data management in the SYSEO project
ACM SIGMOD Record
High volumes of event stream indexing and efficient multi-keyword searching for cloud monitoring
Future Generation Computer Systems
Towards greener data centers with storage class memory
Future Generation Computer Systems
Structuring PLFS for extensibility
PDSW '13 Proceedings of the 8th Parallel Data Storage Workshop
A big data based data storage systems for rock burst experiment
International Journal of Wireless and Mobile Computing
SDF: software-defined flash for web-scale internet storage systems
Proceedings of the 19th international conference on Architectural support for programming languages and operating systems
A MapReduce-based distributed SVM ensemble for scalable image classification and annotation
Computers & Mathematics with Applications
Scalable multi-access flash store for big data analytics
Proceedings of the 2014 ACM/SIGDA international symposium on Field-programmable gate arrays
MapReduce "garbage" collection
CASCON '13 Proceedings of the 2013 Conference of the Center for Advanced Studies on Collaborative Research
Distributed socialite: a datalog-based language for large-scale graph analysis
Proceedings of the VLDB Endowment
A Novel Cost-Effective Interconnection Networks of Modular Datacenters for the Cloud Computing
Proceedings of the Second International Conference on Innovative Computing and Cloud Computing
A Study of Linux File System Evolution
ACM Transactions on Storage (TOS)
Sector-Disk (SD) Erasure Codes for Mixed Failure Modes in RAID Systems
ACM Transactions on Storage (TOS)
IKAROS: An HTTP-Based Distributed File System, for Low Consumption & Low Specification Devices
Journal of Grid Computing
ComMapReduce: An improvement of MapReduce with lightweight communication mechanisms
Data & Knowledge Engineering
Sporadic decentralized resource maintenance for P2P distributed storage networks
Journal of Parallel and Distributed Computing
MORM: A Multi-objective Optimized Replication Management strategy for cloud storage cluster
Journal of Systems Architecture: the EUROMICRO Journal
An improved partitioning mechanism for optimizing massive data analysis using MapReduce
The Journal of Supercomputing
The Journal of Supercomputing
A multi-dimensional index structure based on improved VA-file and CAN in the cloud
International Journal of Automation and Computing
Beyond IaaS and PaaS: An Extended Cloud Taxonomy for Computation, Storage and Networking
UCC '13 Proceedings of the 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing
A study of Linux file system evolution
FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies
Warming up storage-level caches with bonfire
FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies
SD codes: erasure codes designed for how storage systems really fail
FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies
HARDFS: hardening HDFS with selective and lightweight versioning
FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies
MixApart: decoupled analytics for shared storage systems
FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies
Shroud: ensuring private access to large-scale data in the data center
FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies
Log-structured memory for DRAM-based storage
FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
Analysis of HDFS under HBase: a facebook messages case study
FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
SpringFS: bridging agility and performance in elastic distributed storage
FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
A novel approach to data deduplication over the engineering-oriented cloud systems
Integrated Computer-Aided Engineering
Optimizing I/O forwarding techniques for extreme-scale event tracing
Cluster Computing
GPFS-SNC: an enterprise cluster file system for big data
IBM Journal of Research and Development
Exalt: empowering researchers to evaluate large-scale storage systems
NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation
Blizzard: fast, cloud-scale block storage for cloud-oblivious applications
NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation
Hi-index | 0.05 |