Compilers: principles, techniques, and tools
Compilers: principles, techniques, and tools
Transaction management in the R* distributed database management system
ACM Transactions on Database Systems (TODS)
A measure of transaction processing power
Datamation
ACM Transactions on Database Systems (TODS)
OOPLSA '86 Conference proceedings on Object-oriented programming systems, languages and applications
Multi-disk management algorithms
SIGMETRICS '87 Proceedings of the 1987 ACM SIGMETRICS conference on Measurement and modeling of computer systems
A workload characterization pipeline for models of parallel systems
SIGMETRICS '87 Proceedings of the 1987 ACM SIGMETRICS conference on Measurement and modeling of computer systems
801 storage: architecture and programming
ACM Transactions on Computer Systems (TOCS)
Types and persistence in database programming languages
ACM Computing Surveys (CSUR)
Process and dataflow control in distributed data-intensive systems
SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
A benchmark of NonStop SQL on the debit credit transaction
SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
Comparison of dataflow control techniques in distributed data-intensive systems
SIGMETRICS '88 Proceedings of the 1988 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Schema design and mapping strategies for persistent object models
Information and Software Technology
DPDS '88 Proceedings of the first international symposium on Databases in parallel and distributed systems
Parallelizing a database programming language
DPDS '88 Proceedings of the first international symposium on Databases in parallel and distributed systems
A comparison of high-availability media recovery techniques
SIGMOD '89 Proceedings of the 1989 ACM SIGMOD international conference on Management of data
EDBT '90 Proceedings of the 2nd international conference on extending database technology: Advances in Database Technology
VLDB '89 Proceedings of the 15th international conference on Very large data bases
Compiling FAD, a database programming language
Proceedings of the second international workshop on Database programming languages
Using CSIM to model complex systems
WSC '88 Proceedings of the 20th conference on Winter simulation
Locking Performance in a Shared Nothing Parallel Database Machine
IEEE Transactions on Knowledge and Data Engineering
A Single-User Performance Evaluation of the Teradata Database Machine
Proceedings of the 2nd International Workshop on High Performance Transaction Systems
A Parallel Container Model for Data Intensive Applications
IWDM '89 Proceedings of the Sixth International Workshop on Database Machines
An Experiment on Response Time Scalability in Bubba
IWDM '89 Proceedings of the Sixth International Workshop on Database Machines
GAMMA - A High Performance Dataflow Database Machine
VLDB '86 Proceedings of the 12th International Conference on Very Large Data Bases
LDL: A Logic-Based Data Language
VLDB '86 Proceedings of the 12th International Conference on Very Large Data Bases
FAD, a Powerful and Simple Database Language
VLDB '87 Proceedings of the 13th International Conference on Very Large Data Bases
Virtual memory transaction management
ACM SIGOPS Operating Systems Review
Dataflow query processing using multiprocessor hash-partitioned algorithms (database, pipeline, parallelism)
Problems in supporting data base transactions in an operating system transaction manager
ACM SIGOPS Operating Systems Review
Dynamic file allocation in disk arrays
SIGMOD '91 Proceedings of the 1991 ACM SIGMOD international conference on Management of data
OOPSLA '91 Conference proceedings on Object-oriented programming systems, languages, and applications
Parallel database systems: the future of database processing or a passing fad?
ACM SIGMOD Record - Directions for future database research & development
Parallel database systems: the future of high performance database systems
Communications of the ACM
A performance analysis of alternative multi-attribute declustering strategies
SIGMOD '92 Proceedings of the 1992 ACM SIGMOD international conference on Management of data
Exploiting database parallelism in a message-passing multiprocessor
IBM Journal of Research and Development
ACM Transactions on Database Systems (TODS)
Query evaluation techniques for large databases
ACM Computing Surveys (CSUR)
On optimal processor allocation to support pipelined hash joins
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Dynamic Load Balancing in Very Large Shared-Nothing Hypercube Database Computers
IEEE Transactions on Computers
A Parallel Hash Join Algorithm for Managing Data Skew
IEEE Transactions on Parallel and Distributed Systems
Parallelism in relational database management systems
IBM Systems Journal
A Performance Study of Robust Distributed Load Sharing Strategies
IEEE Transactions on Parallel and Distributed Systems
Minipage locking support for object-oriented page-server DBMS
CIKM '94 Proceedings of the third international conference on Information and knowledge management
On parallel execution of multiple pipelined hash joins
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
IBM Systems Journal
Parallelism and its price: a case study of nonstop SQL/MP
ACM SIGMOD Record
Parallel evaluation of multi-join queries
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Open issues in parallel query optimization
ACM SIGMOD Record
High-performance sorting on networks of workstations
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Parallel Execution of Hash Joins in Parallel Databases
IEEE Transactions on Parallel and Distributed Systems
Cluster architectures and S/390 Parallel Sysplex scalability
IBM Systems Journal
Intensive Data Management in Parallel Systems: A Survey
Distributed and Parallel Databases
Polar: an architecture for a parallel ODMG compliant object database
Proceedings of the ninth international conference on Information and knowledge management
GeMDA: A Multidimensional Data Partitioning Technique for Multiprocessor Database Systems
Distributed and Parallel Databases
Systems support for scalable data mining
ACM SIGKDD Explorations Newsletter - Special issue on “Scalable data mining algorithms”
ACM Transactions on Computer Systems (TOCS)
Programming and Computing Software
Load Balancing for Parallel Query Execution on NUMA Multiprocessors
Distributed and Parallel Databases
Benchmarking the DBS3 Parallel Query Optimizer
IEEE Parallel & Distributed Technology: Systems & Technology
Parallel query processing with zigzag trees
The VLDB Journal — The International Journal on Very Large Data Bases - Parallelism in database systems
Considering data skew factor in multi-way join query optimization for parallel execution
The VLDB Journal — The International Journal on Very Large Data Bases - Parallelism in database systems
A FAD for Data Intensive Applications
IEEE Transactions on Knowledge and Data Engineering
PRISMA/DB: A Parallel, Main Memory Relational DBMS
IEEE Transactions on Knowledge and Data Engineering
Large Join Optimization on a Hypercube Multiprocessor
IEEE Transactions on Knowledge and Data Engineering
Site and Query Scheduling Policies in Multicomputer Database Systems
IEEE Transactions on Knowledge and Data Engineering
Performance Analysis of Affinity Clustering on Transaction Processing Coupling Architecture
IEEE Transactions on Knowledge and Data Engineering
Applying Segmented Right-Deep Trees to Pipelining Multiple Hash Joins
IEEE Transactions on Knowledge and Data Engineering
Dynamic Load Balancing in Multicomputer Database Systems Using Partition Tuning
IEEE Transactions on Knowledge and Data Engineering
Utilizing Page-Level Join Index for Optimization in Parallel Join Execution
IEEE Transactions on Knowledge and Data Engineering
Optimization of Parallel Execution for Multi-Join Queries
IEEE Transactions on Knowledge and Data Engineering
Scalability Analysis of Declustering Methods for Multidimensional Range Queries
IEEE Transactions on Knowledge and Data Engineering
A Parallel Sort Merge Join Algorithm for Managing Data Skew
IEEE Transactions on Parallel and Distributed Systems
Control Versus Data Flow in Parallel Database Machines
IEEE Transactions on Parallel and Distributed Systems
Performance Evaluation of Transaction Processing Coupling Architectures for Handling System Dynamics
IEEE Transactions on Parallel and Distributed Systems
MAGIC: A Multiattribute Declustering Mechanism for Multiprocessor Database Machines
IEEE Transactions on Parallel and Distributed Systems
Distributed Load Balancing for Parallel Main Memory Hash Join
IEEE Transactions on Parallel and Distributed Systems
Encapsulation of Parallelism and Architecture-Independence in Extensible Database Query Execution
IEEE Transactions on Software Engineering
Join and Data Redistribution Algorithms for Hypercubes
IEEE Transactions on Knowledge and Data Engineering
Efficient OLAP query processing in distributed data warehouses
Information Systems - Special issue: Best papers from EDBT 2002
Efficient OLAP Query Processing in Distributed Data Warehouses
EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
A Requirements Analysis for Parallel KDD Systems
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
Performance Studies of Shared-Nothing Parallel Transaction Processing Systems
PaCT '999 Proceedings of the 5th International Conference on Parallel Computing Technologies
VLDB '91 Proceedings of the 17th International Conference on Very Large Data Bases
Handling Data Skew in Multiprocessor Database Computers Using Partition Tuning
VLDB '91 Proceedings of the 17th International Conference on Very Large Data Bases
Optimization of Multi-Way Join Queries for Parallel Execution
VLDB '91 Proceedings of the 17th International Conference on Very Large Data Bases
Using Segmented Right-Deep Trees for the Execution of Pipelined Hash Joins
VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Parallelism in a Main-Memory DBMS: The Performance of PRISMA/DB
VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Applying Hash Filters to Improving the Execution of Bushy Trees
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
Managing Memory to Meet Multiclass Workload Response Time Goals
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
Performance of Data-Parallel Spatial Operations
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
A Non-Uniform Data Fragmentation Strategy for Parallel Main-Menory Database Systems
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Coloring Away Communication in Parallel Query Optimization
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Managing Intra-operator Parallelism in Parallel Database Systems
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Estimation of Query-Result Distribution and its Application in Parallel-Join Load Balancing
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Dynamic Load Balancing in Hierarchical Parallel Database Systems
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
OLAP Query Evaluation in a Database Cluster: A Performance Study on Intra-Query Parallelism
ADBIS '02 Proceedings of the 6th East European Conference on Advances in Databases and Information Systems
Adding a Performance-Oriented Perspective to Data Warehouse Design
DaWaK 2000 Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery
DEXA '00 Proceedings of the 11th International Conference on Database and Expert Systems Applications
Data placement in shared-nothing parallel database systems
The VLDB Journal — The International Journal on Very Large Data Bases
On applying hash filters to improving the execution of multi-join queries
The VLDB Journal — The International Journal on Very Large Data Bases
Data and knowledge in database systems: parallel databases
Handbook of data mining and knowledge discovery
Reducing the virtual memory overhead of swizzling
IWOOOS '96 Proceedings of the 5th International Workshop on Object Orientation in Operating Systems (IWOOOS '96)
The Design, Implementation and Evaluation of an ODMG Compliant, Parallel Object Database Server
Distributed and Parallel Databases
Survey of Architectures of Parallel Database Systems
Programming and Computing Software
Hash-based labeling techniques for storage scaling
The VLDB Journal — The International Journal on Very Large Data Bases
Parallel querying with non-dedicated computers
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Revisiting pipelined parallelism in multi-join query processing
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Reactive provisioning of backend databases in shared dynamic content server clusters
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
InfiniteDB: a pc-cluster based parallel massive database management system
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Conflict-aware scheduling for dynamic content applications
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Distributed data-parallel computing using a high-level programming language
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
MapReduce and parallel DBMSs: friends or foes?
Communications of the ACM - Amir Pnueli: Ahead of His Time
Parallel OLAP with the Sidera server
Future Generation Computer Systems
Adaptively parallelizing distributed range queries
Proceedings of the VLDB Endowment
Sidera: a cluster-based server for online analytical processing
OTM'07 Proceedings of the 2007 OTM confederated international conference on On the move to meaningful internet systems: CoopIS, DOA, ODBASE, GADA, and IS - Volume Part II
Comet: batched stream processing for data intensive distributed computing
Proceedings of the 1st ACM symposium on Cloud computing
Low overhead concurrency control for partitioned main memory databases
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
Adaptive query execution for data management in the cloud
CloudDB '10 Proceedings of the second international workshop on Cloud data management
Comparing Hadoop and Fat-Btree based access method for small file I/O applications
WAIM'10 Proceedings of the 11th international conference on Web-age information management
IDEAS'97 Proceedings of the 1997 international conference on International database engineering and applications symposium
Large-scale incremental processing using distributed transactions and notifications
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Parallelizing join computations of SPARQL queries for large semantic web databases
Proceedings of the 2011 ACM Symposium on Applied Computing
Accelerating large semantic web databases by parallel join computations of SPARQL queries
ACM SIGAPP Applied Computing Review
Granola: low-overhead distributed transaction coordination
USENIX ATC'12 Proceedings of the 2012 USENIX conference on Annual Technical Conference
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
ACM SIGOPS 24th Symposium on Operating Systems Principles
Transaction chains: achieving serializability with low latency in geo-distributed storage systems
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
Hi-index | 0.02 |
Bubba is a highly parallel computer system for data-intensive applications. The basis of the Bubba design is a scalable shared-nothing architecture which can scale up to thousands of nodes. Data are declustered across the nodes (i.e. horizontally partitioned via hashing or range partitioning) and operations are executed at those nodes containing relevant data. In this way, parallelism can be exploited within individual transactions as well as among multiple concurrent transactions to improve throughput and response times for data-intensive applications. The current Bubba prototype runs on a commercial 40-node multicomputer and includes a parallelizing compiler, distributed transaction management, object management, and a customized version of Unix. The current prototype is described and the major design decisions that went into its construction are discussed. The lessons learned from this prototype and its predecessors are presented.