Reimplementing the Cedar file system using logging and group commit
SOSP '87 Proceedings of the eleventh ACM Symposium on Operating systems principles
SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
Parallel database systems: the future of high performance database systems
Communications of the ACM
The Zebra striped network file system
SOSP '93 Proceedings of the fourteenth ACM symposium on Operating systems principles
IBM Systems Journal
“One size fits all” database architectures do not work for DSS
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
The log-structured merge-tree (LSM-tree)
Acta Informatica
ACM Transactions on Computer Systems (TOCS)
Resource containers: a new facility for resource management in server systems
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Daytona and the fourth-generation language Cymbal
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
ACM Computing Surveys (CSUR)
Space/time trade-offs in hash coding with allowable errors
Communications of the ACM
Recursive functions of symbolic expressions and their computation by machine, Part I
Communications of the ACM
Chord: A scalable peer-to-peer lookup service for internet applications
Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
A scalable content-addressable network
Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
Implementation techniques for main memory database systems
SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
Mariposa: A New Architecture for Distributed Data
Proceedings of the Tenth International Conference on Data Engineering
Weaving Relations for Cache Performance
Proceedings of the 27th International Conference on Very Large Data Bases
Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems
Middleware '01 Proceedings of the IFIP/ACM International Conference on Distributed Systems Platforms Heidelberg
Notes on Data Base Operating Systems
Operating Systems, An Advanced Course
Data Compression Using Long Common Strings
DCC '99 Proceedings of the Conference on Data Compression
Tapestry: An Infrastructure for Fault-tolerant Wide-area Location and
Tapestry: An Infrastructure for Fault-tolerant Wide-area Location and
SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
C-store: a column-oriented DBMS
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Integrating compression and execution in column-oriented database systems
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Interpreting the data: Parallel analysis with Sawzall
Scientific Programming - Dynamic Grids and Worldwide Computing
Operating system support for planetary-scale network services
NSDI'04 Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation - Volume 1
Boxwood: abstractions as the foundation for storage infrastructure
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
MapReduce: simplified data processing on large clusters
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Paxos made live: an engineering perspective
Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing
The Chubby lock service for loosely-coupled distributed systems
OSDI '06 Proceedings of the 7th symposium on Operating systems design and implementation
Map-reduce-merge: simplified relational data processing on large clusters
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Paxos made live: an engineering perspective
Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing
Sinfonia: a new paradigm for building scalable distributed systems
Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
The Chubby lock service for loosely-coupled distributed systems
OSDI '06 Proceedings of the 7th symposium on Operating systems design and implementation
A relational approach to incrementally extracting and querying structure in unstructured data
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Noncontiguous locking techniques for parallel file systems
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Bigtable: A Distributed Storage System for Structured Data
ACM Transactions on Computer Systems (TOCS)
Service-oriented data denormalization for scalable web applications
Proceedings of the 17th international conference on World Wide Web
Data management projects at Google
ACM SIGMOD Record
The design methodology of Phoenix cluster system software stack
CHINA HPC '07 Proceedings of the 2007 Asian technology information program's (ATIP's) 3rd workshop on High performance computing in China: solution approaches to impediments for high performance computing
Efficient bulk insertion into a distributed ordered table
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
OLTP through the looking glass, and what we found there
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Pig latin: a not-so-foreign language for data processing
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Accelerating large-scale data exploration through data diffusion
DADC '08 Proceedings of the 2008 international workshop on Data-aware distributed computing
DIMM: a distributed metadata management for data-intensive HPC environments
DADC '08 Proceedings of the 2008 international workshop on Data-aware distributed computing
Distributed computing in SOSP and OSDI
ACM SIGACT News
Large-Scale Parallel Collaborative Filtering for the Netflix Prize
AAIM '08 Proceedings of the 4th international conference on Algorithmic Aspects in Information and Management
Read-Optimized, Cache-Conscious, Page Layouts for Temporal Relational Data
DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
A practical scalable distributed B-tree
Proceedings of the VLDB Endowment
SCOPE: easy and efficient parallel processing of massive data sets
Proceedings of the VLDB Endowment
GRIMS: a scalable management and storage system for massive remote sensing images
Proceedings of the 3rd international conference on Scalable information systems
Criteria to Compare Cloud Computing with Current Database Technology
IWSM/Metrikon/Mensura '08 Proceedings of the International Conferences on Software Process and Product Measurement
Teaching large scale data processing: the five-week course and two years' experiences
SCE '08 Proceedings of the 1st ACM Summit on Computing Education in China on First ACM Summit on Computing Education in China
Clouder: a flexible large scale decentralized object store: architecture overview
Proceedings of the Third Workshop on Dependable Distributed Data Management
Sparse indexing: large scale, inline deduplication using sampling and locality
FAST '09 Proccedings of the 7th conference on File and storage technologies
Scalable Transactions for Web Applications in the Cloud
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Sinfonia: A new paradigm for building scalable distributed systems
ACM Transactions on Computer Systems (TOCS)
Modular data storage with Anvil
Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
SPIDER: a system for scalable, parallel / distributed evaluation of large-scale RDF data
Proceedings of the 18th ACM conference on Information and knowledge management
Query processing of massive trajectory data based on mapreduce
Proceedings of the first international workshop on Cloud data management
An efficient multi-dimensional index for cloud data management
Proceedings of the first international workshop on Cloud data management
Consistency rationing in the cloud: pay only when it matters
Proceedings of the VLDB Endowment
DisTec: Towards a Distributed System for Telecom Computing
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
LazyBase: freshness vs. performance in information management
ACM SIGOPS Operating Systems Review
HadoopToSQL: a mapReduce query optimizer
Proceedings of the 5th European conference on Computer systems
Cassandra: a decentralized structured storage system
ACM SIGOPS Operating Systems Review
Semi-join computation on distributed file systems using map-reduce-merge model
Proceedings of the 2010 ACM Symposium on Applied Computing
Towards scalable architectures for clickstream data warehousing
DNIS'07 Proceedings of the 5th international conference on Databases in networked information systems
Stateful bulk processing for incremental analytics
Proceedings of the 1st ACM symposium on Cloud computing
The case for PIQL: a performance insightful query language
Proceedings of the 1st ACM symposium on Cloud computing
G-Store: a scalable data store for transactional multi key access in the cloud
Proceedings of the 1st ACM symposium on Cloud computing
Multiscale not multicore: efficient heterogeneous cloud computing
Proceedings of the 2010 ACM-BCS Visions of Computer Science Conference
Extremely large-scale sensing applications for planetary WSNs
Proceedings of the 2nd ACM International Workshop on Hot Topics in Planet-scale Measurement
Keeping bits safe: how hard can it be?
Communications of the ACM
BlueStar: a federation-based approach to building internet-scale data centers
IBM Journal of Research and Development
A new analysis of the false positive rate of a Bloom filter
Information Processing Letters
Parallel processing of data from very large-scale wireless sensor networks
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
ElasTraS: an elastic transactional data store in the cloud
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
Fast and cautious evolution of cloud storage
HotStorage'10 Proceedings of the 2nd USENIX conference on Hot topics in storage and file systems
The YouTube video recommendation system
Proceedings of the fourth ACM conference on Recommender systems
Keeping Bits Safe: How Hard Can It Be?
Queue - Storage
ESQP: an efficient SQL query processing for cloud data management
CloudDB '10 Proceedings of the second international workshop on Cloud data management
Benchmarking cloud-based data management systems
CloudDB '10 Proceedings of the second international workshop on Cloud data management
NPC'10 Proceedings of the 2010 IFIP international conference on Network and parallel computing
An access cost-aware approach for object retrieval over multiple sources
Proceedings of the VLDB Endowment
Big data and cloud computing: new wine or just new bottles?
Proceedings of the VLDB Endowment
Knuckles: bringing the database to the data
International Journal of Computational Science and Engineering
What consistency does your key-value store actually provide?
HotDep'10 Proceedings of the Sixth international conference on Hot topics in system dependability
Availability in globally distributed storage systems
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Large-scale incremental processing using distributed transactions and notifications
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
On the expressiveness and trade-offs of large scale tuple stores
OTM'10 Proceedings of the 2010 international conference on On the move to meaningful internet systems: Part II
dBug: systematic evaluation of distributed systems
SSV'10 Proceedings of the 5th international conference on Systems software verification
Using Paxos to build a scalable, consistent, and highly available datastore
Proceedings of the VLDB Endowment
DICE: an effective query result cache for distributed storage systems
Journal of Computer Science and Technology
Big data and cloud computing: current state and future opportunities
Proceedings of the 14th International Conference on Extending Database Technology
PRESIDIO: A Framework for Efficient Archival Data Storage
ACM Transactions on Storage (TOS)
FATE and DESTINI: a framework for cloud recovery testing
Proceedings of the 8th USENIX conference on Networked systems design and implementation
Scalable SQL and NoSQL data stores
ACM SIGMOD Record
Aspects of data-intensive cloud computing
From active data management to event-based systems and more
Schema-as-you-go: on probabilistic tagging and querying of wide tables
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Databases and Social Networks
Database scalability, elasticity, and autonomy in the cloud
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
Okeanos: wasteless journaling for fast and reliable multistream storage
USENIXATC'11 Proceedings of the 2011 USENIX conference on USENIX annual technical conference
Don't thrash: how to cache your hash on flash
HotStorage'11 Proceedings of the 3rd USENIX conference on Hot topics in storage and file systems
Proceedings of the VLDB Endowment
A correlation-aware data placement strategy for key-value stores
Proceedings of the 11th IFIP WG 6.1 international conference on Distributed applications and interoperable systems
Data integration over NoSQL stores using access path based mappings
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
An efficient quad-tree based index structure for cloud data management
WAIM'11 Proceedings of the 12th international conference on Web-age information management
Detecting failures in distributed systems with the Falcon spy network
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Transactional storage for geo-replicated systems
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
TripleCloud: An Infrastructure for Exploratory Querying over Web-Scale RDF Data
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Incremental recomputations in MapReduce
Proceedings of the third international workshop on Cloud data management
Efficient data distribution strategy for join query processing in the cloud
Proceedings of the third international workshop on Cloud data management
Proceedings of the third international workshop on Cloud data management
Scalable data management in distributed information systems
OTM'11 Proceedings of the 2011th Confederated international conference on On the move to meaningful internet systems
PIQL: success-tolerant query processing in the cloud
Proceedings of the VLDB Endowment
Scalable real time data management for smart grid
Proceedings of the Middleware 2011 Industry Track Workshop
Object-relational event middleware for web applications
Proceedings of the 2011 Conference of the Center for Advanced Studies on Collaborative Research
CernVM-FS: delivering scientific software to globally distributed computing resources
Proceedings of the first international workshop on Network-aware data management
Microsharding: a declarative approach to support elastic OLTP workloads
ACM SIGOPS Operating Systems Review
Data management challenges in cloud computing infrastructures
DNIS'10 Proceedings of the 6th international conference on Databases in Networked Information Systems
Agent based cloud storage system
AIC'10/BEBI'10 Proceedings of the 10th WSEAS international conference on applied informatics and communications, and 3rd WSEAS international conference on Biomedical electronics and biomedical informatics
The evolving landscape of data management in the cloud
International Journal of Computational Science and Engineering
Performance engineering for cloud computing
EPEW'11 Proceedings of the 8th European conference on Computer Performance Engineering
DPillar: Dual-port server interconnection network for large scale data centers
Computer Networks: The International Journal of Computer and Telecommunications Networking
Partiqle: an elastic SQL engine over key-value stores
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Towards principled design support for scalable OLTP workloads
Proceedings of the 15th International Conference on Extending Database Technology
Jingwei+: a distributed large-scale RDF data server
APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Projecting disk usage based on historical trends in a cloud environment
Proceedings of the 3rd workshop on Scientific Cloud Computing Date
LogBase: a scalable log-structured database system in the cloud
Proceedings of the VLDB Endowment
TBF: a high-efficient query mechanism in de-duplication backup system
GPC'12 Proceedings of the 7th international conference on Advances in Grid and Pervasive Computing
RAMCube: exploiting network proximity for ram-based key-value store
HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Using R for iterative and incremental processing
HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Automated diagnosis without predictability is a recipe for failure
HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Practical hardening of crash-tolerant systems
USENIX ATC'12 Proceedings of the 2012 USENIX conference on Annual Technical Conference
A MapReduce-supported network structure for data centers
Concurrency and Computation: Practice & Experience
Don't thrash: how to cache your hash on flash
Proceedings of the VLDB Endowment
Solving big data challenges for enterprise application performance management
Proceedings of the VLDB Endowment
MyCassandra: a cloud storage supporting both read heavy and write heavy workloads
Proceedings of the 5th Annual International Systems and Storage Conference
Storage provisioning and allocation in a large cloud environment
Proceedings of the 2012 workshop on Management of big data systems
MySQL to NoSQL: data modeling challenges in supporting scalability
Proceedings of the 3rd annual conference on Systems, programming, and applications: software for humanity
High throughput computing over peer-to-peer networks
Future Generation Computer Systems
Enhancing coordination in cloud infrastructures with an extendable coordination service
Proceedings of the Workshop on Secure and Dependable Middleware for Cloud Monitoring and Management
Communications of the ACM
MapReduce-Based data stream processing over large history data
ICSOC'12 Proceedings of the 10th international conference on Service-Oriented Computing
Secure distributed storage for bulk data
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part V
Large-scale ranking and selection using cloud computing
Proceedings of the Winter Simulation Conference
Improving Bandwidth Efficiency for Consistent Multistream Storage
ACM Transactions on Storage (TOS)
ElasTraS: An elastic, scalable, and self-managing transactional database for the cloud
ACM Transactions on Database Systems (TODS)
Cloud Platform Datastore Support
Journal of Grid Computing
A New Electronic Commerce Architecture in the Cloud
Journal of Electronic Commerce in Organizations
Massive electronic records processing for digital archives in cloud
ICPCA/SWS'12 Proceedings of the 2012 international conference on Pervasive Computing and the Networked World
Evaluating Cassandra as a manager of large file sets
Proceedings of the 3rd International Workshop on Cloud Data and Platforms
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Distributed and Parallel Databases
Future Generation Computer Systems
Fast candidate generation for real-time tweet search with bloom filter chains
ACM Transactions on Information Systems (TOIS)
Cache conscious star-join in MapReduce environments
Proceedings of the 2nd International Workshop on Cloud Intelligence
Archiving the relaxed consistency web
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Simplifying MapReduce data processing
International Journal of Computational Science and Engineering
On The Potential Integration of an Ontology-Based Data Access Approach in NoSQL Stores
International Journal of Distributed Systems and Technologies
Fast multi-fields query processing in bigtable based cloud systems
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Janus: optimal flash provisioning for cloud storage workloads
USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference
Managing smartphone testbeds with smartlab
LISA'13 Proceedings of the 27th international conference on Large Installation System Administration
Document vector representations for feature extraction in multi-stage document ranking
Information Retrieval
ComMapReduce: An improvement of MapReduce with lightweight communication mechanisms
Data & Knowledge Engineering
An improved partitioning mechanism for optimizing massive data analysis using MapReduce
The Journal of Supercomputing
HARDFS: hardening HDFS with selective and lightweight versioning
FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies
Log-structured memory for DRAM-based storage
FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
Analysis of HDFS under HBase: a facebook messages case study
FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
A novel approach to data deduplication over the engineering-oriented cloud systems
Integrated Computer-Aided Engineering
Hi-index | 0.03 |
Bigtable is a distributed storage system for managing structured data that is designed to scale to a very large size: petabytes of data across thousands of commodity servers. Many projects at Google store data in Bigtable, including web indexing, Google Earth, and Google Finance. These applications place very different demands on Bigtable, both in terms of data size (from URLs to web pages to satellite imagery) and latency requirements (from backend bulk processing to real-time data serving). Despite these varied demands, Bigtable has successfully provided a flexible, high-performance solution for all of these Google products. In this paper we describe the simple data model provided by Bigtable, which gives clients dynamic control over data layout and format, and we describe the design and implementation of Bigtable.