SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Interpreting the data: Parallel analysis with Sawzall
Scientific Programming - Dynamic Grids and Worldwide Computing
MapReduce: simplified data processing on large clusters
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Bigtable: a distributed storage system for structured data
OSDI '06 Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7
Pig latin: a not-so-foreign language for data processing
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
A comparison of approaches to large-scale data analysis
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
HadoopDB: an architectural hybrid of MapReduce and DBMS technologies for analytical workloads
Proceedings of the VLDB Endowment
Caching and Materialization for Web Databases
Foundations and Trends in Databases
HadoopToSQL: a mapReduce query optimizer
Proceedings of the 5th European conference on Computer systems
SPARQL basic graph pattern processing with iterative MapReduce
Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud
The case for PIQL: a performance insightful query language
Proceedings of the 1st ACM symposium on Cloud computing
Indexing multi-dimensional data in a cloud system
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Estimating rates of rare events with multiple hierarchies through scalable log-linear models
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Parallel bulk insertion for large-scale analytics applications
Proceedings of the 4th International Workshop on Large Scale Distributed Systems and Middleware
Relational versus non-relational database systems for data warehousing
DOLAP '10 Proceedings of the ACM 13th international workshop on Data warehousing and OLAP
The e-recall environment for cloud based mobile rich media data management
Proceedings of the 2010 ACM multimedia workshop on Mobile cloud media computing
Optimizing the pre-processing of scientific visualization techniques using QEF
Proceedings of the 8th International Workshop on Middleware for Grids, Clouds and e-Science
Private searching on MapReduce
TrustBus'10 Proceedings of the 7th international conference on Trust, privacy and security in digital business
Dremel: interactive analysis of web-scale datasets
Proceedings of the VLDB Endowment
The performance of MapReduce: an in-depth study
Proceedings of the VLDB Endowment
Hadoop++: making a yellow elephant run like a cheetah (without it even noticing)
Proceedings of the VLDB Endowment
Integrating MapReduce and RDBMSs
Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research
Focus replay debugging effort on the control plane
HotDep'10 Proceedings of the Sixth international conference on Hot topics in system dependability
Scalable knowledge harvesting with high precision and high recall
Proceedings of the fourth ACM international conference on Web search and data mining
Qex: symbolic SQL query explorer
LPAR'10 Proceedings of the 16th international conference on Logic for programming, artificial intelligence, and reasoning
Dremel: interactive analysis of web-scale datasets
Communications of the ACM
ASTERIX: towards a scalable, semistructured data platform for evolving-world models
Distributed and Parallel Databases
Towards improved load balancing for data intensive distributed computing
Proceedings of the 2011 ACM Symposium on Applied Computing
Column-oriented storage techniques for MapReduce
Proceedings of the VLDB Endowment
Full-text indexing for optimizing selection operations in large-scale data analytics
Proceedings of the second international workshop on MapReduce and its applications
Computational REST meets Erlang
TOOLS'11 Proceedings of the 49th international conference on Objects, models, components, patterns
The ETLMR MapReduce-based ETL framework
SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
ETLMR: a highly scalable dimensional ETL framework based on mapreduce
DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Tagged mapreduce: efficiently computing multi-analytics using mapreduce
DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Orleans: cloud computing for everyone
Proceedings of the 2nd ACM Symposium on Cloud Computing
Trojan data layouts: right shoes for a running elephant
Proceedings of the 2nd ACM Symposium on Cloud Computing
Fay: extensible distributed tracing from kernels to clusters
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Advances in Engineering Software
An approach for processing large and non-uniform media objects on mapreduce-based clusters
ICADL'11 Proceedings of the 13th international conference on Asia-pacific digital libraries: for cultural heritage, knowledge dissemination, and future creation
Parallel data processing with MapReduce: a survey
ACM SIGMOD Record
Of hammers and nails: an empirical comparison of three paradigms for processing large graphs
Proceedings of the fifth ACM international conference on Web search and data mining
Distributed parallel architecture for storing and processing large datasets
SEPADS'12/EDUCATION'12 Proceedings of the 11th WSEAS international conference on Software Engineering, Parallel and Distributed Systems, and proceedings of the 9th WSEAS international conference on Engineering Education
Apriori-based frequent itemset mining algorithms on MapReduce
Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication
Optimal trust mining and computing on keyed mapreduce
ESSoS'12 Proceedings of the 4th international conference on Engineering Secure Software and Systems
Sorting, searching, and simulation in the mapreduce framework
ISAAC'11 Proceedings of the 22nd international conference on Algorithms and Computation
What next?: a half-dozen data management research goals for big data and the cloud
PODS '12 Proceedings of the 31st symposium on Principles of Database Systems
Camdoop: exploiting in-network aggregation for big data applications
NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
The efficiency of mapreduce in parallel external memory
LATIN'12 Proceedings of the 10th Latin American international conference on Theoretical Informatics
Inside "Big Data management": ogres, onions, or parfaits?
Proceedings of the 15th International Conference on Extending Database Technology
Adaptive MapReduce using situation-aware mappers
Proceedings of the 15th International Conference on Extending Database Technology
Scalable complex event processing on top of mapreduce
APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Towards a scalable, performance-oriented OLAP storage engine
DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part II
Cooperative private searching in clouds
Journal of Parallel and Distributed Computing
Improving the diagnosis of mild hypertrophic cardiomyopathy with MapReduce
Proceedings of third international workshop on MapReduce and its Applications Date
MapReduce indexing strategies: Studying scalability and efficiency
Information Processing and Management: an International Journal
Parallel computation skeletons with premature termination property
FLOPS'12 Proceedings of the 11th international conference on Functional and Logic Programming
A MapReduce-supported network structure for data centers
Concurrency and Computation: Practice & Experience
Only aggressive elephants are fast elephants
Proceedings of the VLDB Endowment
Efficient big data processing in Hadoop MapReduce
Proceedings of the VLDB Endowment
Parallelized computing of attribute core based on rough set theory and mapreduce
RSKT'12 Proceedings of the 7th international conference on Rough Sets and Knowledge Technology
Fay: Extensible Distributed Tracing from Kernels to Clusters
ACM Transactions on Computer Systems (TOCS)
A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
Journal of Grid Computing
Spanner: Google's globally-distributed database
OSDI'12 Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
Towards benchmarking stream data warehouses
Proceedings of the fifteenth international workshop on Data warehousing and OLAP
Sailfish: a framework for large scale data processing
Proceedings of the Third ACM Symposium on Cloud Computing
Business Intelligence and Analytics: Research Directions
ACM Transactions on Management Information Systems (TMIS)
Cogset: a high performance MapReduce engine
Concurrency and Computation: Practice & Experience
Towards building a high performance spatial query system for large scale medical imaging data
Proceedings of the 20th International Conference on Advances in Geographic Information Systems
MobiS: a distributed paradigm of mobile sensor data analytics for evaluating environmental exposures
Proceedings of the First ACM SIGSPATIAL International Workshop on Mobile Geographic Information Systems
Bulk synchronous visualization
Proceedings of the 2013 International Workshop on Programming Models and Applications for Multicores and Manycores
Evaluating parameter sweep workflows in high performance computing
Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
Elastic online analytical processing on RAMCloud
Proceedings of the 16th International Conference on Extending Database Technology
Capturing and querying workflow runtime provenance with PROV: a practical approach
Proceedings of the Joint EDBT/ICDT 2013 Workshops
High performance parallel evolutionary algorithm model based on MapReduce framework
International Journal of Computer Applications in Technology
Scaling big data mining infrastructure: the twitter experience
ACM SIGKDD Explorations Newsletter
Performance evaluation of parallel strategies in public clouds: A study with phylogenomic workflows
Future Generation Computer Systems
Future Generation Computer Systems
Spanner: Google’s Globally Distributed Database
ACM Transactions on Computer Systems (TOCS)
Simulation process support for climate data analysis
Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference
HAT: history-based auto-tuning MapReduce in heterogeneous environments
The Journal of Supercomputing
Data-Intensive Cloud Computing: Requirements, Expectations, Challenges, and Solutions
Journal of Grid Computing
MrCrypt: static analysis for secure cloud computations
Proceedings of the 2013 ACM SIGPLAN international conference on Object oriented programming systems languages & applications
Feature-based analysis of large-scale spatio-temporal sensor data on hybrid architectures
International Journal of High Performance Computing Applications
Analysis of partitioning strategies for graph processing in bulk synchronous parallel models
Proceedings of the fifth international workshop on Cloud data management
The family of mapreduce and large-scale data processing systems
ACM Computing Surveys (CSUR)
Accelerate MapReduce on GPUs with multi-level reduction
Proceedings of the 5th Asia-Pacific Symposium on Internetware
Hadoop GIS: a high performance spatial data warehousing system over mapreduce
Proceedings of the VLDB Endowment
Asynchronous object storage with QoS for scientific and commercial big data
PDSW '13 Proceedings of the 8th Parallel Data Storage Workshop
Modeling and optimizing large-scale data flows
Future Generation Computer Systems
Run-time performance optimization of a BigData query language
Proceedings of the 5th ACM/SPEC international conference on Performance engineering
ACM SIGMOD Record
Hi-index | 0.02 |
MapReduce advantages over parallel databases include storage-system independence and fine-grain fault tolerance for large jobs.