Efficient dispersal of information for security, load balancing, and fault tolerance
Journal of the ACM (JACM)
Scans as Primitive Parallel Operations
IEEE Transactions on Computers
High-performance sorting on networks of workstations
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Cluster-based scalable network services
Proceedings of the sixteenth ACM symposium on Operating systems principles
Journal of the ACM (JACM)
Systematic Efficient Parallelization of Scan and Other List Homomorphisms
Euro-Par '96 Proceedings of the Second International Euro-Par Conference on Parallel Processing-Volume II
SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Diamond: A Storage Architecture for Early Discard in Interactive Search
FAST '04 Proceedings of the 3rd USENIX Conference on File and Storage Technologies
Explicit control a batch-aware distributed file system
NSDI'04 Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation - Volume 1
MapReduce: simplified data processing on large clusters
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Evaluating MapReduce for Multi-core and Multiprocessor Systems
HPCA '07 Proceedings of the 2007 IEEE 13th International Symposium on High Performance Computer Architecture
Exploring a digital library through key ideas
Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
Generating links by mining quotations
Proceedings of the nineteenth ACM conference on Hypertext and hypermedia
Characterizing botnets from email spam records
LEET'08 Proceedings of the 1st Usenix Workshop on Large-Scale Exploits and Emergent Threats
A domain-specific language for parallel and grid computing
Proceedings of the 2008 AOSD workshop on Domain-specific aspect languages
Proceedings of the 10th international conference on Electronic commerce
A scalable parallel framework for analyzing terascale molecular dynamics simulation trajectories
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Compute and storage clouds using wide area high performance networks
Future Generation Computer Systems
Mars: a MapReduce framework on graphics processors
Proceedings of the 17th international conference on Parallel architectures and compilation techniques
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Dynamic user-defined similarity searching in semi-structured text retrieval
Proceedings of the 3rd international conference on Scalable information systems
Minimum-cost delegation in service composition
Theoretical Computer Science
Communications of the ACM - Being Human in the Digital Age
SIGPLAN programming language curriculum workshop: Workshop report summary
ACM SIGPLAN Notices
Programming languages in a liberal arts education
ACM SIGPLAN Notices
An aspect-oriented approach to the undergraduate programming language curriculum
ACM SIGPLAN Notices
Proceedings of the 4th workshop on Declarative aspects of multicore programming
Queue - Game Development
Criteria to Compare Cloud Computing with Current Database Technology
IWSM/Metrikon/Mensura '08 Proceedings of the International Conferences on Software Process and Product Measurement
Firefox (In) security update dynamics exposed
ACM SIGCOMM Computer Communication Review
A model for fast web mining prototyping
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Communications of the ACM - Security in the Browser
Teaching large scale data processing: the five-week course and two years' experiences
SCE '08 Proceedings of the 1st ACM Summit on Computing Education in China on First ACM Summit on Computing Education in China
Improving the responsiveness of internet services with automatic cache placement
Proceedings of the 4th ACM European conference on Computer systems
Estimating the impressionrank of web pages
Proceedings of the 18th international conference on World wide web
Collaborative filtering for orkut communities: discovery of user latent behavior
Proceedings of the 18th international conference on World wide web
Adaptive workload allocation in query processing in autonomous heterogeneous environments
Distributed and Parallel Databases
Distributed Data Mining Tasks and Patterns as Services
Euro-Par 2008 Workshops - Parallel Processing
SLIPstream: scalable low-latency interactive perception on streaming data
Proceedings of the 18th international workshop on Network and operating systems support for digital audio and video
Resource co-allocation for large-scale distributed environments
Proceedings of the 18th ACM international symposium on High performance distributed computing
Using realistic simulation for performance analysis of mapreduce setups
Proceedings of the 1st ACM workshop on Large-Scale system and application performance
Large-scale behavioral targeting
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
TANGENT: a novel, 'Surprise me', recommendation algorithm
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Data Parallel Bin-Based Indexing for Answering Queries on Multi-core Architectures
SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
What the parallel-processing community has (failed) to offer the multi/many-core generation
Journal of Parallel and Distributed Computing
Online Risk Analytics on the Cloud
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
Brief announcement: PUSH, a DISC shell
Proceedings of the 28th ACM symposium on Principles of distributed computing
Understanding TCP incast throughput collapse in datacenter networks
Proceedings of the 1st ACM workshop on Research on enterprise networking
Towards Efficient MapReduce Using MPI
Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Evaluating SPLASH-2 Applications Using MapReduce
APPT '09 Proceedings of the 8th International Symposium on Advanced Parallel Processing Technologies
Implementing Parallel Google Map-Reduce in Eden
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
A Vision for Next Generation Query Processors and an Associated Research Agenda
Globe '09 Proceedings of the 2nd International Conference on Data Management in Grid and Peer-to-Peer Systems
Distributed Algorithm for Computing Formal Concepts Using Map-Reduce Framework
IDA '09 Proceedings of the 8th International Symposium on Intelligent Data Analysis: Advances in Intelligent Data Analysis VIII
Coupling semi-supervised learning of categories and relations
SemiSupLearn '09 Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing
Dynamic load balancing for I/O-intensive applications on clusters
ACM Transactions on Storage (TOS)
Quincy: fair scheduling for distributed computing clusters
Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
Large-scale multimedia semantic concept modeling using robust subspace bagging and MapReduce
LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Tour the world: a technical demonstration of a web-scale landmark recognition engine
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Low-cost management of inverted files for online full-text search
Proceedings of the 18th ACM conference on Information and knowledge management
Practical lessons of data mining at Yahoo!
Proceedings of the 18th ACM conference on Information and knowledge management
Stochastic gradient boosted distributed decision trees
Proceedings of the 18th ACM conference on Information and knowledge management
Exploring many task computing in scientific workflows
Proceedings of the 2nd Workshop on Many-Task Computing on Grids and Supercomputers
An efficient multi-dimensional index for cloud data management
Proceedings of the first international workshop on Cloud data management
Flexible procurement of services with uncertain durations using redundancy
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Mining document collections to facilitate accurate approximate entity matching
Proceedings of the VLDB Endowment
Distributed online aggregations
Proceedings of the VLDB Endowment
A Privacy Manager for Cloud Computing
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
Enterprise Cloud Architecture for Chinese Ministry of Railway
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
Towards a Theory of Universally Composable Cloud Computing
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
Parallel K-Means Clustering Based on MapReduce
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
Distributed Scheduling Extension on Hadoop
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
A Skeletal Parallel Framework with Fusion Optimizer for GPGPU Programming
APLAS '09 Proceedings of the 7th Asian Symposium on Programming Languages and Systems
Practically Applicable Formal Methods
SOFSEM '10 Proceedings of the 36th Conference on Current Trends in Theory and Practice of Computer Science
High-performance high-volume layered corpora annotation
ACL-IJCNLP '09 Proceedings of the Third Linguistic Annotation Workshop
Towards an Algebraic foundation for business planning
Proceedings of the 2009 EDBT/ICDT Workshops
What can visual content analysis do for text based image search?
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Web-scale distributional similarity and entity set expansion
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Analysis of hyperspectral data with diffusion maps and fuzzy ART
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
ISI'09 Proceedings of the 2009 IEEE international conference on Intelligence and security informatics
Mixing Hadoop and HPC workloads on parallel filesystems
Proceedings of the 4th Annual Workshop on Petascale Data Storage
Coupled semi-supervised learning for information extraction
Proceedings of the third ACM international conference on Web search and data mining
Exploiting multi-level parallelism for low-latency activity recognition in streaming video
MMSys '10 Proceedings of the first annual ACM SIGMM conference on Multimedia systems
Clouds at the crossroads: research perspectives
Crossroads - Plugging Into the Cloud
Multicore education: pieces of the parallel puzzle
Proceedings of the 41st ACM technical symposium on Computer science education
Accelerating SQL database operations on a GPU with CUDA
Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units
Optimizing joins in a map-reduce environment
Proceedings of the 13th International Conference on Extending Database Technology
On the energy (in)efficiency of Hadoop clusters
ACM SIGOPS Operating Systems Review
Controlling your TV with gestures
Proceedings of the international conference on Multimedia information retrieval
Measuring the user experience on a large scale: user-centered metrics for web applications
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Caching and Materialization for Web Databases
Foundations and Trends in Databases
Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling
Proceedings of the 5th European conference on Computer systems
Mining advertiser-specific user behavior using adfactors
Proceedings of the 19th international conference on World wide web
Proceedings of the 19th international conference on World wide web
Cloud-TM: harnessing the cloud with distributed transactional memories
ACM SIGOPS Operating Systems Review
A unified execution model for cloud computing
ACM SIGOPS Operating Systems Review
Cassandra: a decentralized structured storage system
ACM SIGOPS Operating Systems Review
Distributed indexing of web scale datasets for the cloud
Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud
Beyond online aggregation: parallel and incremental data mining with online Map-Reduce
Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud
A heterogeneous parallel system running open mpi on a broadband network of embedded set-top devices
Proceedings of the 7th ACM international conference on Computing frontiers
Introducing Scalileo: a Java based scaling framework
Proceedings of the 1st International Conference on Energy-Efficient Computing and Networking
Proceedings of the 11th International Conference on Information Integration and Web-based Applications & Services
FlumeJava: easy, efficient data-parallel pipelines
PLDI '10 Proceedings of the 2010 ACM SIGPLAN conference on Programming language design and implementation
Efficient parallel set-similarity joins using MapReduce
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
PeerWatch: a fault detection and diagnosis tool for virtualized consolidation systems
Proceedings of the 7th international conference on Autonomic computing
Parallelizing XML data-streaming workflows via MapReduce
Journal of Computer and System Sciences
Machine models for query processing
ACM SIGMOD Record
The impact of management operations on the virtualized datacenter
Proceedings of the 37th annual international symposium on Computer architecture
Centrality metric for dynamic networks
Proceedings of the Eighth Workshop on Mining and Learning with Graphs
Malstone: towards a benchmark for analytics on large data clouds
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Toward visual analysis of ensemble data sets
Proceedings of the 2009 Workshop on Ultrascale Visualization
A data placement strategy in scientific cloud workflows
Future Generation Computer Systems
An Analysis of Traces from a Production MapReduce Cluster
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
MOON: MapReduce On Opportunistic eNvironments
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
XCo: explicit coordination to prevent network fabric congestion in cloud computing cluster platforms
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
An overview of the Open Science Data Cloud
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Exploring application and infrastructure adaptation on hybrid grid-cloud infrastructure
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
AzureBlast: a case study of developing science applications on the cloud
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Cloud computing paradigms for pleasingly parallel biomedical applications
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Data parallelism in bioinformatics workflows using Hydra
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Twister: a runtime for iterative MapReduce
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Parallelizing multiple group-by query in share-nothing environment: a MapReduce study case
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Tiled-MapReduce: optimizing resource usages of data-parallel applications on multicore with tiling
Proceedings of the 19th international conference on Parallel architectures and compilation techniques
Accelerating parallel analysis of scientific simulation data via Zazen
FAST'10 Proceedings of the 8th USENIX conference on File and storage technologies
New abstractions for data parallel programming
HotPar'09 Proceedings of the First USENIX conference on Hot topics in parallelism
Airavat: security and privacy for MapReduce
NSDI'10 Proceedings of the 7th USENIX conference on Networked systems design and implementation
Improving MapReduce performance in heterogeneous environments
OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
Corey: an operating system for many cores
OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
Behavioral Targeting: The Art of Scaling Up Simple Algorithms
ACM Transactions on Knowledge Discovery from Data (TKDD)
An experience report on scaling tools for mining software repositories using MapReduce
Proceedings of the IEEE/ACM international conference on Automated software engineering
Query processing in a DBMS for cluster systems
Programming and Computing Software
Three challenges in data mining
Frontiers of Computer Science in China
From frequency to meaning: vector space models of semantics
Journal of Artificial Intelligence Research
Real-life performance of metric searching
SIGSPATIAL Special
Proceedings of the First International Workshop on Data Dissemination for Large Scale Complex Critical Infrastructures
Gossamer: a lightweight programming framework for multicore machines
HotPar'10 Proceedings of the 2nd USENIX conference on Hot topics in parallelism
Spark: cluster computing with working sets
HotCloud'10 Proceedings of the 2nd USENIX conference on Hot topics in cloud computing
Hybrid bulk synchronous parallelism library for clustered smp architectures
Proceedings of the fourth international workshop on High-level parallel programming and applications
Distributed indexing for semantic search
Proceedings of the 3rd International Semantic Search Workshop
Panel: designing the next educational programming language
Proceedings of the ACM international conference companion on Object oriented programming systems languages and applications companion
A MapReduce approach to Gi*(d) spatial statistic
Proceedings of the ACM SIGSPATIAL International Workshop on High Performance and Distributed Geographic Information Systems
Spatial scene similarity assessment on Hadoop
Proceedings of the ACM SIGSPATIAL International Workshop on High Performance and Distributed Geographic Information Systems
Wisdom of the ages: toward delivering the children's web with the link-based agerank algorithm
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Massive structured data management solution
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
ESQP: an efficient SQL query processing for cloud data management
CloudDB '10 Proceedings of the second international workshop on Cloud data management
Comparing SQL and MapReduce to compute Naive Bayes in a single table scan
CloudDB '10 Proceedings of the second international workshop on Cloud data management
Benchmarking cloud-based data management systems
CloudDB '10 Proceedings of the second international workshop on Cloud data management
A model of computation for MapReduce
SODA '10 Proceedings of the twenty-first annual ACM-SIAM symposium on Discrete Algorithms
Towards extensible automatic image annotation with the bag-of-words approach
Proceedings of the international workshop on Very-large-scale multimedia corpus, mining and retrieval
A marketplace for cloud resources
EMSOFT '10 Proceedings of the tenth ACM international conference on Embedded software
Automated web service query service
International Journal of Web and Grid Services
Network traffic characteristics of data centers in the wild
IMC '10 Proceedings of the 10th ACM SIGCOMM conference on Internet measurement
High throughput data-compression for cloud storage
Globe'10 Proceedings of the Third international conference on Data management in grid and peer-to-peer systems
Merging file systems and data bases to fit the grid
Globe'10 Proceedings of the Third international conference on Data management in grid and peer-to-peer systems
Multidimensional arrays for warehousing data on clouds
Globe'10 Proceedings of the Third international conference on Data management in grid and peer-to-peer systems
Query evaluation techniques for cluster database systems
ADBIS'10 Proceedings of the 14th east European conference on Advances in databases and information systems
Characterising effective resource analyses for parallel and distributed coordination
FOPARA'09 Proceedings of the First international conference on Foundational and practical aspects of resource analysis
Parallel evolutionary approach of compaction problem using mapreduce
PPSN'10 Proceedings of the 11th international conference on Parallel problem solving from nature: Part II
Adaptive conflict unit size for distributed optimistic synchronization
EuroPar'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part I
Universal connection architecture for interactive applications to achieve distributed computing
Journal of Network and Computer Applications
A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers
Software—Practice & Experience - Focus on Selected PhD Literature Reviews in the Practical Aspects of Software Technology
Resource allocation across multiple cloud data centres
Proceedings of the 8th International Workshop on Middleware for Grids, Clouds and e-Science
A middleware for parallel processing of large graphs
Proceedings of the 8th International Workshop on Middleware for Grids, Clouds and e-Science
Private searching on MapReduce
TrustBus'10 Proceedings of the 7th international conference on Trust, privacy and security in digital business
PH2: an hadoop-based framework for mining structural properties from the PDB database
SAICSIT '10 Proceedings of the 2010 Annual Research Conference of the South African Institute of Computer Scientists and Information Technologists
BlobSeer: Next-generation data management for large scale infrastructures
Journal of Parallel and Distributed Computing
From a stream of relational queries to distributed stream processing
Proceedings of the VLDB Endowment
An analysis of Linux scalability to many cores
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Nectar: automatic management of data and computation in datacenters
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Chukwa: a system for reliable large-scale log collection
LISA'10 Proceedings of the 24th international conference on Large installation system administration
Parallel K-means clustering of remote sensing images based on mapreduce
WISM'10 Proceedings of the 2010 international conference on Web information systems and mining
Parallel accessing massive NetCDF data based on mapreduce
WISM'10 Proceedings of the 2010 international conference on Web information systems and mining
A graphical representation for identifier structure in logs
SLAML'10 Proceedings of the 2010 workshop on Managing systems via log analysis and machine learning techniques
Optimizing data analysis with a semi-structured time series database
SLAML'10 Proceedings of the 2010 workshop on Managing systems via log analysis and machine learning techniques
The high-activity parallel implementation of data preprocessing based on MapReduce
RSKT'10 Proceedings of the 5th international conference on Rough set and knowledge technology
Attribute reduction for massive data based on rough set theory and MapReduce
RSKT'10 Proceedings of the 5th international conference on Rough set and knowledge technology
OOLAM: an opinion oriented link analysis model for influence persona discovery
Proceedings of the fourth ACM international conference on Web search and data mining
Query suggestion for E-commerce sites
Proceedings of the fourth ACM international conference on Web search and data mining
Comparing the usability of library vs. language approaches to task parallelism
Evaluation and Usability of Programming Languages and Tools
Semantic analysis and retrieval in personal and social photo collections
Multimedia Tools and Applications
Proceedings of the 9th Annual Workshop on Network and Systems Support for Games
Social Services Computing: Concepts, Research Challenges, and Directions
GREENCOM-CPSCOM '10 Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing
Load and storage balanced posting file partitioning for parallel information retrieval
Journal of Systems and Software
On Two-Dimensional Sparse Matrix Partitioning: Models, Methods, and a Recipe
SIAM Journal on Scientific Computing
Efficient k-nearest neighbor graph construction for generic similarity measures
Proceedings of the 20th international conference on World wide web
CnC-CUDA: declarative programming for GPUs
LCPC'10 Proceedings of the 23rd international conference on Languages and compilers for parallel computing
Scheduling large jobs by abstraction refinement
Proceedings of the sixth conference on Computer systems
Optimizing intermediate data management in MapReduce computations
Proceedings of the First International Workshop on Cloud Computing Platforms
Variable-sized map and locality-aware reduce on public-resource grids
Future Generation Computer Systems
Wireless link scheduling for data center networks
Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
Architectural Requirements for Cloud Computing Systems: An Enterprise Cloud Approach
Journal of Grid Computing
CrowdForge: crowdsourcing complex work
CHI '11 Extended Abstracts on Human Factors in Computing Systems
A load-aware scheduler for MapReduce framework in heterogeneous cloud environments
Proceedings of the 2011 ACM Symposium on Applied Computing
A fast approach for parallel deduplication on multicore processors
Proceedings of the 2011 ACM Symposium on Applied Computing
Scalable clone detection using description logic
Proceedings of the 5th International Workshop on Software Clones
A cloud-enabled regional climate model evaluation system
Proceedings of the 2nd International Workshop on Software Engineering for Cloud Computing
Automatic performance debugging of SPMD-style parallel programs
Journal of Parallel and Distributed Computing
Column-oriented storage techniques for MapReduce
Proceedings of the VLDB Endowment
Social content matching in MapReduce
Proceedings of the VLDB Endowment
Brasil: basic resource aggregation system infrastructure layer
Proceedings of the 1st International Workshop on Runtime and Operating Systems for Supercomputers
Incremental graph pattern matching
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Parallelism and data movement characterization of contemporary application classes
Proceedings of the twenty-third annual ACM symposium on Parallelism in algorithms and architectures
On scheduling in map-reduce and flow-shops
Proceedings of the twenty-third annual ACM symposium on Parallelism in algorithms and architectures
Providing scalable database services on the cloud
WISE'10 Proceedings of the 11th international conference on Web information systems engineering
HotOS'13 Proceedings of the 13th USENIX conference on Hot topics in operating systems
Large scale visual-based event matching
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Garbage collection auto-tuning for Java mapreduce on multi-cores
Proceedings of the international symposium on Memory management
Proceedings of the ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
A hierarchical framework for cross-domain MapReduce execution
Proceedings of the second international workshop on Emerging computational methods for the life sciences
Static type checking of Hadoop MapReduce programs
Proceedings of the second international workshop on MapReduce and its applications
Proceedings of the second international workshop on MapReduce and its applications
Enhancement of Xen's scheduler for MapReduce workloads
Proceedings of the 20th international symposium on High performance distributed computing
Adapting MapReduce for HPC environments
Proceedings of the 20th international symposium on High performance distributed computing
A distributed look-up architecture for text mining applications using mapreduce
Proceedings of the 20th international symposium on High performance distributed computing
A unified representation of web logs for mining applications
Information Retrieval
ARIA: automatic resource inference and allocation for mapreduce environments
Proceedings of the 8th ACM international conference on Autonomic computing
PigSPARQL: mapping SPARQL to Pig Latin
Proceedings of the International Workshop on Semantic Web Information Management
OpenTopography: a services oriented architecture for community access to LIDAR topography
Proceedings of the 2nd International Conference on Computing for Geospatial Research & Applications
Odessa: enabling interactive perception applications on mobile devices
MobiSys '11 Proceedings of the 9th international conference on Mobile systems, applications, and services
Architecture-based fault tolerance support for grid applications
Proceedings of the joint ACM SIGSOFT conference -- QoSA and ACM SIGSOFT symposium -- ISARCS on Quality of software architectures -- QoSA and architecting critical systems -- ISARCS
A paradigm comparison for collecting TV channel statistics from high-volume channel zap events
Proceedings of the 5th ACM international conference on Distributed event-based system
Learning condensed feature representations from large unsupervised data sets for supervised learning
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Distributed tuning of machine learning algorithms using MapReduce Clusters
Proceedings of the Third Workshop on Large Scale Data Mining: Theory and Applications
ACM SIGMETRICS Performance Evaluation Review - Performance evaluation review
Multi-layer graph-based semi-supervised learning for large-scale image datasets using mapreduce
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
An intermediate algebra for optimizing RDF graph pattern matching on MapReduce
ESWC'11 Proceedings of the 8th extended semantic web conference on The semanic web: research and applications - Volume Part II
Cloud-based malware detection for evolving data streams
ACM Transactions on Management Information Systems (TMIS)
Detecting adversarial advertisements in the wild
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
FLEX: a slot allocation scheduling optimizer for MapReduce workloads
Proceedings of the ACM/IFIP/USENIX 11th International Conference on Middleware
Synoptic: studying logged behavior with inferred models
Proceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on Foundations of software engineering
Brown Dwarf: A fully-distributed, fault-tolerant data warehousing system
Journal of Parallel and Distributed Computing
Algorithms and mechanisms for procuring services with uncertain durations using redundancy
Artificial Intelligence
CloudFuice: a flexible cloud-based data integration system
ICWE'11 Proceedings of the 11th international conference on Web engineering
On the benefits of transparent compression for cost-effective cloud data storage
Transactions on large-scale data- and knowledge-centered systems III
Filtering harmful sentences based on three-word co-occurrence
Proceedings of the 8th Annual Collaboration, Electronic messaging, Anti-Abuse and Spam Conference
International Journal of Systems, Control and Communications
Rule-based distributed and agent systems
RuleML'2011 Proceedings of the 5th international conference on Rule-based reasoning, programming, and applications
Database foundations for scalable RDF processing
RW'11 Proceedings of the 7th international conference on Reasoning web: semantic technologies for the web of data
Application and evaluation of inductive reasoning methods for the semantic web and software analysis
RW'11 Proceedings of the 7th international conference on Reasoning web: semantic technologies for the web of data
Proceedings of the 4th ACM symposium on Haskell
Cache size in a cost model for heterogeneous skeletons
Proceedings of the fifth international workshop on High-level parallel programming and applications
Making standard ML a practical database programming language
Proceedings of the 16th ACM SIGPLAN international conference on Functional programming
ComputErl - erlang-based framework for many task computing
TFP'10 Proceedings of the 11th international conference on Trends in functional programming
NEFOS: rapid cache-aware range query processing with probabilistic guarantees
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
Data integration over NoSQL stores using access path based mappings
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
Data-driven modeling and analysis of online social networks
WAIM'11 Proceedings of the 12th international conference on Web-age information management
Green challenges to system software in data centers
Frontiers of Computer Science in China
RR'11 Proceedings of the 5th international conference on Web reasoning and rule systems
HORT: Hadoop online ray tracing with mapreduce
ACM SIGGRAPH 2011 Posters
Mining large distributed log data in near real time
SLAML '11 Managing Large-scale Systems via the Analysis of System Logs and the Application of Machine Learning Techniques
Modeling and synthesizing task placement constraints in Google compute clusters
Proceedings of the 2nd ACM Symposium on Cloud Computing
No one (cluster) size fits all: automatic cluster sizing for data-intensive analytics
Proceedings of the 2nd ACM Symposium on Cloud Computing
Utilizing green energy prediction to schedule mixed batch and service jobs in data centers
HotPower '11 Proceedings of the 4th Workshop on Power-Aware Computing and Systems
Evaluating the performance and scalability of mapreduce applications on X10
APPT'11 Proceedings of the 9th international conference on Advanced parallel processing technologies
Comparing high level mapreduce query languages
APPT'11 Proceedings of the 9th international conference on Advanced parallel processing technologies
Forty data communications research questions
ACM SIGCOMM Computer Communication Review
Sedic: privacy-aware data intensive computing on hybrid clouds
Proceedings of the 18th ACM conference on Computer and communications security
CrowdForge: crowdsourcing complex work
Proceedings of the 24th annual ACM symposium on User interface software and technology
The jabberwocky programming environment for structured social computing
Proceedings of the 24th annual ACM symposium on User interface software and technology
Strict serializability is harmless: a new architecture for enterprise applications
Proceedings of the ACM international conference companion on Object oriented programming systems languages and applications companion
Enhancing query support in HBase Via An Extended Coprocessors Framework
ServiceWave'11 Proceedings of the 4th European conference on Towards a service-based internet
dipLODocus[RDF]: short and long-tail RDF analytics for massive webs of data
ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part I
Simplified parallel domain traversal
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Scalable hashing for shared memory supercomputers
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Auto-scaling to minimize cost and meet application deadlines in cloud workflows
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
A distributed look-up architecture for text mining applications using MapReduce
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
A linear-time approximation of the earth mover's distance
Proceedings of the 20th ACM international conference on Information and knowledge management
Efficient data distribution strategy for join query processing in the cloud
Proceedings of the third international workshop on Cloud data management
Authentication of range query results in mapreduce environments
Proceedings of the third international workshop on Cloud data management
Analytics over large-scale multidimensional data: the big data revolution!
Proceedings of the ACM 14th international workshop on Data Warehousing and OLAP
A Load-Driven Task Scheduler with Adaptive DSC for MapReduce
GREENCOM '11 Proceedings of the 2011 IEEE/ACM International Conference on Green Computing and Communications
Performance evaluation of MapReduce using full virtualisation on a departmental cloud
International Journal of Applied Mathematics and Computer Science - SPECIAL SECTION: Efficient Resource Management for Grid-Enabled Applications
Cloud computing: programming model and information exchange mechanism
Proceedings of the 2011 International Conference on Innovative Computing and Cloud Computing
An approach for processing large and non-uniform media objects on mapreduce-based clusters
ICADL'11 Proceedings of the 13th international conference on Asia-pacific digital libraries: for cultural heritage, knowledge dissemination, and future creation
SpotMPI: a framework for auction-based HPC computing using amazon spot instances
ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part II
Using Coq in specification and program extraction of hadoop mapreduce applications
SEFM'11 Proceedings of the 9th international conference on Software engineering and formal methods
Understanding and improving the diagnostic workflow of MapReduce users
CHIMIT '11 Proceedings of the 5th ACM Symposium on Computer Human Interaction for Management of Information Technology
The Combinatorial BLAS: design, implementation, and applications
International Journal of High Performance Computing Applications
Program Ultra-Dispatcher for launching applications in a customization manner on cloud computing
Journal of Network and Computer Applications
MARIANE: MApReduce Implementation Adapted for HPC Environments
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
Benchmarking MapReduce Implementations for Application Usage Scenarios
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
Scalable and Distributed Processing of Scientific XML Data
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
Embedded Processor Virtualization for Broadband Grid Computing
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
Energy efficient scheduling of MapReduce workloads on heterogeneous clusters
Green Computing Middleware on Proceedings of the 2nd International Workshop
Prediction-based auto-scaling of scientific workflows
Proceedings of the 9th International Workshop on Middleware for Grids, Clouds and e-Science
A survey of emerging approaches to spam filtering
ACM Computing Surveys (CSUR)
Elastic complex event processing
Proceedings of the 8th Middleware Doctoral Symposium
An adaptive scheduling algorithm for dynamic heterogeneous Hadoop systems
Proceedings of the 2011 Conference of the Center for Advanced Studies on Collaborative Research
ChuQL: processing XML with XQuery using Hadoop
Proceedings of the 2011 Conference of the Center for Advanced Studies on Collaborative Research
Geometric overpass extraction from vector road data and DSMs
Proceedings of the 19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
SpSJoin: parallel spatial similarity joins
Proceedings of the 19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
High-performance processing of text queries with tunable pruned term and term pair indexes
ACM Transactions on Information Systems (TOIS)
Utilizing green energy prediction to schedule mixed batch and service jobs in data centers
ACM SIGOPS Operating Systems Review
Parallel data processing with MapReduce: a survey
ACM SIGMOD Record
NoSQL databases: a step to database scalability in web environment
Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services
Runtime model validation with parallel object constraint language
Proceedings of the 8th International Workshop on Model-Driven Engineering, Verification and Validation
More convenient more overhead: the performance evaluation of Hadoop streaming
Proceedings of the 2011 ACM Symposium on Research in Applied Computation
Distributed workflow-driven analysis of large-scale biological data using biokepler
Proceedings of the 2nd international workshop on Petascal data analytics: challenges and opportunities
Provenance for MapReduce-based data-intensive workflows
Proceedings of the 6th workshop on Workflows in support of large-scale science
Supporting dynamic parameter sweep in adaptive and user-steered workflow
Proceedings of the 6th workshop on Workflows in support of large-scale science
Descriptive matrix factorization for sustainability Adopting the principle of opposites
Data Mining and Knowledge Discovery
Multi-pass sorted neighborhood blocking with MapReduce
Computer Science - Research and Development
Scientific data services: a high-performance I/O system with array semantics
Proceedings of the first annual workshop on High performance computing meets databases
Executing multiple group by query using mapreduce approach: implementation and optimization
GPC'10 Proceedings of the 5th international conference on Advances in Grid and Pervasive Computing
A fully-protected large-scale email system built on map-reduce framework
GPC'10 Proceedings of the 5th international conference on Advances in Grid and Pervasive Computing
CIRCUMFLEX: a scheduling optimizer for MapReduce workloads with shared scans
ACM SIGOPS Operating Systems Review
Mitigating the negative impact of preemption on heterogeneous MapReduce workloads
Proceedings of the 7th International Conference on Network and Services Management
Tarazu: optimizing MapReduce on heterogeneous clusters
ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems
Enhancing TCP throughput of highly available virtual machines via speculative communication
VEE '12 Proceedings of the 8th ACM SIGPLAN/SIGOPS conference on Virtual Execution Environments
Hierarchical link analysis for ranking web data
ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
Experiences teaching MapReduce in the cloud
Proceedings of the 43rd ACM technical symposium on Computer Science Education
Social networking in developing regions
Proceedings of the Fifth International Conference on Information and Communication Technologies and Development
ACM SIGMETRICS Performance Evaluation Review
ACM SIGMETRICS Performance Evaluation Review
The Aneka platform and QoS-driven resource provisioning for elastic applications on hybrid Clouds
Future Generation Computer Systems
A proposal for user-defined reductions in OpenMP
IWOMP'10 Proceedings of the 6th international conference on Beyond Loop Level Parallelism in OpenMP: accelerators, Tasking and more
Meeting service level objectives of Pig programs
Proceedings of the 2nd International Workshop on Cloud Computing Platforms
Energy efficiency for large-scale MapReduce workloads with significant interactive analysis
Proceedings of the 7th ACM european conference on Computer Systems
Kineograph: taking the pulse of a fast-changing and connected world
Proceedings of the 7th ACM european conference on Computer Systems
Jockey: guaranteed job latency in data parallel clusters
Proceedings of the 7th ACM european conference on Computer Systems
HotCloud'11 Proceedings of the 3rd USENIX conference on Hot topics in cloud computing
The datacenter needs an operating system
HotCloud'11 Proceedings of the 3rd USENIX conference on Hot topics in cloud computing
CC'10/ETAPS'10 Proceedings of the 19th joint European conference on Theory and Practice of Software, international conference on Compiler Construction
ICONIP'11 Proceedings of the 18th international conference on Neural Information Processing - Volume Part II
Many-Core architecture oriented parallel algorithm design for computer animation
MIG'11 Proceedings of the 4th international conference on Motion in Games
Proceedings of the Seventh Annual Workshop on Cyber Security and Information Intelligence Research
A parallel method for computing rough set approximations
Information Sciences: an International Journal
Distributed parallel architecture for storing and processing large datasets
SEPADS'12/EDUCATION'12 Proceedings of the 11th WSEAS international conference on Software Engineering, Parallel and Distributed Systems, and proceedings of the 9th WSEAS international conference on Engineering Education
Fused state machines for fault tolerance in distributed systems
OPODIS'11 Proceedings of the 15th international conference on Principles of Distributed Systems
Adaptive parallelization of queries to data providing web service operations
Transactions on Large-Scale Data- and Knowledge-Centered Systems V
Abstract state machines for data-parallel computing
Conceptual Modelling and Its Theoretical Foundations
Matrix chain multiplication via multi-way join algorithms in MapReduce
Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication
Foundations and Trends® in Machine Learning
Parallelizing an index generator for desktop search
ISCA'10 Proceedings of the 2010 international conference on Computer Architecture
Cluster computing, recursion and datalog
Datalog'10 Proceedings of the First international conference on Datalog Reloaded
The computing framework for physics analysis at LHCb
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume 2
RAVEN --- boosting data analysis for the LHC experiments
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume 2
Optimal trust mining and computing on keyed mapreduce
ESSoS'12 Proceedings of the 4th international conference on Engineering Secure Software and Systems
A new framework for join product skew
RED'10 Proceedings of the Third international conference on Resource Discovery
H2RDF: adaptive query processing on RDF data in the cloud.
Proceedings of the 21st international conference companion on World Wide Web
Harnessing user library statistics for research evaluation and knowledge domain visualization
Proceedings of the 21st international conference companion on World Wide Web
Large scale microblog mining using distributed MB-LDA
Proceedings of the 21st international conference companion on World Wide Web
Confidant: protecting OSN data without locking it up
Middleware'11 Proceedings of the 12th ACM/IFIP/USENIX international conference on Middleware
Resource provisioning framework for mapreduce jobs with performance goals
Middleware'11 Proceedings of the 12th ACM/IFIP/USENIX international conference on Middleware
Sorting, searching, and simulation in the mapreduce framework
ISAAC'11 Proceedings of the 22nd international conference on Algorithms and Computation
iMapReduce: A Distributed Computing Framework for Iterative Computation
Journal of Grid Computing
Applying traffic merging to datacenter networks
Proceedings of the 3rd International Conference on Future Energy Systems: Where Energy, Computing and Communication Meet
Parallel programming: design of an overview class
Proceedings of the 2011 ACM SIGPLAN X10 Workshop
Managing and mining large graphs: systems and implementations
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Clydesdale: structured data processing on hadoop
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Camdoop: exploiting in-network aggregation for big data applications
NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
The structure of online diffusion networks
Proceedings of the 13th ACM Conference on Electronic Commerce
SWIM '12 Proceedings of the 4th International Workshop on Semantic Web Information Management
Flexible and efficient distributed resolution of large entities
FoIKS'12 Proceedings of the 7th international conference on Foundations of Information and Knowledge Systems
Scalable sequence similarity search and join in main memory on multi-cores
Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing - Volume 2
P2P-MapReduce: Parallel data processing in dynamic Cloud environments
Journal of Computer and System Sciences
Proceedings of the 27th Annual ACM Symposium on Applied Computing
Towards building large-scale distributed systems for twitter sentiment analysis
Proceedings of the 27th Annual ACM Symposium on Applied Computing
Clydesdale: structured data processing on MapReduce
Proceedings of the 15th International Conference on Extending Database Technology
Transitive closure and recursive Datalog implemented on clusters
Proceedings of the 15th International Conference on Extending Database Technology
Serendipity: enabling remote computing among intermittently connected mobile devices
Proceedings of the thirteenth ACM international symposium on Mobile Ad Hoc Networking and Computing
Mapping a data-flow programming model onto heterogeneous platforms
Proceedings of the 13th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, Tools and Theory for Embedded Systems
Delay tails in MapReduce scheduling
Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems
Scalable subspace logistic regression models for high dimensional data
APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Bizard: an online multi-dimensional data analysis visualization tool
APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
ESOP'12 Proceedings of the 21st European conference on Programming Languages and Systems
Pool-Based distributed evolutionary algorithms using an object database
EvoApplications'12 Proceedings of the 2012t European conference on Applications of Evolutionary Computation
Distributed simulated annealing with mapreduce
EvoApplications'12 Proceedings of the 2012t European conference on Applications of Evolutionary Computation
Flex-GP: genetic programming on the cloud
EvoApplications'12 Proceedings of the 2012t European conference on Applications of Evolutionary Computation
Proceedings of the 5th International ICST Conference on Simulation Tools and Techniques
Cooperative private searching in clouds
Journal of Parallel and Distributed Computing
Graph pattern matching revised for social network analysis
Proceedings of the 15th International Conference on Database Theory
MapReduce in MPI for Large-scale graph algorithms
Parallel Computing
Swift: A language for distributed parallel scripting
Parallel Computing
Investigation of data locality and fairness in MapReduce
Proceedings of third international workshop on MapReduce and its Applications Date
Coupling scheduler for MapReduce/Hadoop
Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
PonD: dynamic creation of HTC pool on demand using a decentralized resource discovery system
Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
Massively-parallel stream processing under QoS constraints with Nephele
Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
Data-driven fault tolerance for work stealing computations
Proceedings of the 26th ACM international conference on Supercomputing
Space-round tradeoffs for MapReduce computations
Proceedings of the 26th ACM international conference on Supercomputing
Adaptive heterogeneous language support within a cloud runtime
Future Generation Computer Systems
OPTIMIS: A holistic approach to cloud service provisioning
Future Generation Computer Systems
Future Generation Computer Systems
Towards efficient data search and subsetting of large-scale atmospheric datasets
Future Generation Computer Systems
Towards Trusted Services: Result Verification Schemes for MapReduce
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
MARLA: MapReduce for Heterogeneous Clusters
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Executing Data-Intensive Workloads in a Cloud
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Hierarchical MapReduce Programming Model and Scheduling Algorithms
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Optimizing Completion Time and Resource Provisioning of Pig Programs
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
On Managing Very Large Sensor-Network Data Using Bigtable
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
MapReduce Workload Modeling with Statistical Approach
Journal of Grid Computing
Trends in Trends in Functional Programming 1999/2000 versus 2007/2008
Higher-Order and Symbolic Computation
Elastic computing: A portable optimization framework for hybrid computers
Parallel Computing
Cost models for view materialization in the cloud
Proceedings of the 2012 Joint EDBT/ICDT Workshops
Proceedings of the 2012 Joint EDBT/ICDT Workshops
Memory-restricted latent semantic analysis to accumulate term-document co-occurrence events
Pattern Recognition Letters
Fast and accurate link prediction in social networking systems
Journal of Systems and Software
SofEA: a pool-based framework for evolutionary algorithms using CouchDB
Proceedings of the 14th annual conference companion on Genetic and evolutionary computation
Sketching and streaming algorithms for processing massive data
XRDS: Crossroads, The ACM Magazine for Students - Big Data
Component-based approach for programming and running scientific applications on grids and clouds
International Journal of High Performance Computing Applications
Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems
Computational challenges in nanoparticle partition function calculation
Proceedings of the 1st Conference of the Extreme Science and Engineering Discovery Environment: Bridging from the eXtreme to the campus and beyond
Personalized news recommendation: a review and an experimental investigation
Journal of Computer Science and Technology - Special issue on Community Analysis and Information Recommendation
Making sense of healthcare benefits
Proceedings of the 34th International Conference on Software Engineering
Petri nets state space analysis in the cloud
Proceedings of the 34th International Conference on Software Engineering
Use of permutation prefixes for efficient and scalable approximate similarity search
Information Processing and Management: an International Journal
Explicit coordination to prevent congestion in data center networks
Cluster Computing
Reliable MapReduce computing on opportunistic resources
Cluster Computing
Reference deployment models for eliminating user concerns on cloud security
The Journal of Supercomputing
Enhancing privacy in cloud computing via policy-based obfuscation
The Journal of Supercomputing
DEMON: a local-first discovery method for overlapping communities
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
ComSoc: adaptive transfer of user behaviors over composite social network
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Factoring past exposure in display advertising targeting
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
A panoramic view of 3g data/control-plane traffic: mobile device perspective
IFIP'12 Proceedings of the 11th international IFIP TC 6 conference on Networking - Volume Part I
Using R for iterative and incremental processing
HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Big data platforms as a service: challenges and approach
HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Predicting execution bottlenecks in map-reduce clusters
HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Toward efficient querying of compressed network payloads
USENIX ATC'12 Proceedings of the 2012 USENIX conference on Annual Technical Conference
Foundations and Trends in Information Retrieval
Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Cloudpress 2.0: a next generation news retrieval system on the cloud with a built-in summarizer
Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Evolutionary design of experiments using the MapReduce framework
Proceedings of the 2011 Summer Computer Simulation Conference
Oolong: asynchronous distributed applications made easy
Proceedings of the Asia-Pacific Workshop on Systems
Efficient multi-way theta-join processing using MapReduce
Proceedings of the VLDB Endowment
Opening the black boxes in data flow optimization
Proceedings of the VLDB Endowment
Performance guarantees for distributed reachability queries
Proceedings of the VLDB Endowment
Processing a trillion cells per mouse click
Proceedings of the VLDB Endowment
Parallel rough set based knowledge acquisition using MapReduce from big data
Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
A parallel graph partitioning algorithm to speed up the large-scale distributed graph mining
Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Accelerating Bayesian network parameter learning using Hadoop and MapReduce
Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Parallel software architecture for experimental workflows in computational biology on clouds
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part II
Proceedings of the WICSA/ECSA 2012 Companion Volume
Typing a core binary-field arithmetic in a light logic
FOPARA'11 Proceedings of the Second international conference on Foundational and Practical Aspects of Resource Analysis
Eden --- parallel functional programming with haskell
CEFP'11 Proceedings of the 4th Summer School conference on Central European Functional Programming School
Challenges for future platforms, services and networked applications
CAiSE'12 Proceedings of the 24th international conference on Advanced Information Systems Engineering
MRKDSBC: a distributed background modeling algorithm based on mapreduce
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
Parallel decision tree with application to water quality data analysis
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part II
Haskell vs. f# vs. scala: a high-level language features and parallelism support comparison
Proceedings of the 1st ACM SIGPLAN workshop on Functional high-performance computing
Scalable similarity-based neighborhood methods with MapReduce
Proceedings of the sixth ACM conference on Recommender systems
Parallel ABox reasoning of EL ontologies
JIST'11 Proceedings of the 2011 joint international conference on The Semantic Web
Constructing virtual documents for ontology matching using mapreduce
JIST'11 Proceedings of the 2011 joint international conference on The Semantic Web
PQL: a purely-declarative java extension for parallel programming
ECOOP'12 Proceedings of the 26th European conference on Object-Oriented Programming
M3R: increased performance for in-memory Hadoop jobs
Proceedings of the VLDB Endowment
Using TPIE for processing massive data sets in C++
SIGSPATIAL Special
Parallel max-min ant system using mapreduce
ICSI'12 Proceedings of the Third international conference on Advances in Swarm Intelligence - Volume Part I
Analysis and detection of web spam by means of web content
IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
Future trends in similarity searching
SISAP'12 Proceedings of the 5th international conference on Similarity Search and Applications
LBSNRank: personalized pagerank on location-based social networks
Proceedings of the 2012 ACM Conference on Ubiquitous Computing
Automated profiling and resource management of pig programs for meeting service level objectives
Proceedings of the 9th international conference on Autonomic computing
AROMA: automated resource allocation and configuration of mapreduce environment in the cloud
Proceedings of the 9th international conference on Autonomic computing
Dynamic energy-aware capacity provisioning for cloud computing environments
Proceedings of the 9th international conference on Autonomic computing
Automatic task slots assignment in Hadoop MapReduce
Proceedings of the 1st Workshop on Architectures and Systems for Big Data
Myriad: parallel data generation on shared-nothing architectures
Proceedings of the 1st Workshop on Architectures and Systems for Big Data
Proceedings of the 2012 workshop on Management of big data systems
Proceedings of the 3rd Annual ACM Web Science Conference
Graph-based lexicon expansion with sparsity-inducing penalties
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
An iterative MapReduce approach to frequent subgraph mining in biological datasets
Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine
MySQL to NoSQL: data modeling challenges in supporting scalability
Proceedings of the 3rd annual conference on Systems, programming, and applications: software for humanity
Enhancing genetic algorithms for dependent job scheduling in grid computing environments
The Journal of Supercomputing
A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
Journal of Grid Computing
On the optimization of schedules for MapReduce workloads in the presence of shared scans
The VLDB Journal — The International Journal on Very Large Data Bases
Oolong: asynchronous distributed applications made easy
APSys'12 Proceedings of the Third ACM SIGOPS Asia-Pacific conference on Systems
Internet-based Virtual Computing Environment: Beyond the data center as a computer
Future Generation Computer Systems
Mobile cloud computing: A survey
Future Generation Computer Systems
HSim: A MapReduce simulator in enabling Cloud Computing
Future Generation Computer Systems
Automatic generation of software pipelines for heterogeneous parallel systems
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
On distributed file tree walk of parallel file systems
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
SciQL: a query language for unified scientific data processing and management
Proceedings of the 5th Ph.D. workshop on Information and knowledge
Intent-aware temporal query modeling for keyword suggestion
Proceedings of the 5th Ph.D. workshop on Information and knowledge
A distributed index for efficient parallel top-k keyword search on massive graphs
Proceedings of the twelfth international workshop on Web information and data management
Large scale data analytics on clouds
Proceedings of the fourth international workshop on Cloud data management
Differentially private top-k query over MapReduce
Proceedings of the fourth international workshop on Cloud data management
Reading the web with learned syntactic-semantic inference rules
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Heterogeneity and dynamicity of clouds at scale: Google trace analysis
Proceedings of the Third ACM Symposium on Cloud Computing
Bridging the tenant-provider gap in cloud services
Proceedings of the Third ACM Symposium on Cloud Computing
Balancing reducer skew in MapReduce workloads using progressive sampling
Proceedings of the Third ACM Symposium on Cloud Computing
A case for dual stack virtualization: consolidating HPC and commodity applications in the cloud
Proceedings of the Third ACM Symposium on Cloud Computing
A community based vaccination strategy over mobile phone records
Proceedings of the Second ACM Workshop on Mobile Systems, Applications, and Services for HealthCare
PARMA: a parallel randomized algorithm for approximate association rules mining in MapReduce
Proceedings of the 21st ACM international conference on Information and knowledge management
An effective rule miner for instance matching in a web of data
Proceedings of the 21st ACM international conference on Information and knowledge management
HadoopXML: a suite for parallel processing of massive XML data with multiple twig pattern queries
Proceedings of the 21st ACM international conference on Information and knowledge management
A framework for readapting and running bioinformatics applications in the cloud
Proceedings of the 2012 ACM Research in Applied Computation Symposium
Efficient distributed parallel top-down computation of ROLAP data cube using mapreduce
DaWaK'12 Proceedings of the 14th international conference on Data Warehousing and Knowledge Discovery
Enabling cloud interoperability with COMPSs
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Scheduling mapreduce jobs in HPC clusters
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
On-the-fly task execution for speeding up pipelined mapreduce
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Scalable distributed architecture for media transcoding
ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Enhancing service-oriented computing with software mobility
ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
A cloud architecture with an efficient scheduling technique
ICICA'12 Proceedings of the Third international conference on Information Computing and Applications
Cross-Language high similarity search using a conceptual thesaurus
CLEF'12 Proceedings of the Third international conference on Information Access Evaluation: multilinguality, multimodality, and visual analytics
Multi-core and many-core shared-memory parallel raycasting volume rendering optimization and tuning
International Journal of High Performance Computing Applications
Automated and transparent model fragmentation for persisting large models
MODELS'12 Proceedings of the 15th international conference on Model Driven Engineering Languages and Systems
VMR: volunteer MapReduce over the large scale internet
Proceedings of the 10th International Workshop on Middleware for Grids, Clouds and e-Science
Managing service performance in NoSQL distributed storage systems
Proceedings of the 7th Workshop on Middleware for Next Generation Internet Computing
Failure scenario as a service (FSaaS) for Hadoop clusters
Proceedings of the Workshop on Secure and Dependable Middleware for Cloud Monitoring and Management
An open-source toolkit for mining Wikipedia
Artificial Intelligence
Towards a virtual research environment for language and literature researchers
Future Generation Computer Systems
SubSift web services and workflows for profiling and comparing scientists and their published works
Future Generation Computer Systems
Future Generation Computer Systems
G-Hadoop: MapReduce across distributed data centers for data-intensive computing
Future Generation Computer Systems
Checking and handling inconsistency of DBpedia
WISM'12 Proceedings of the 2012 international conference on Web Information Systems and Mining
Next challenges for adaptive learning systems
ACM SIGKDD Explorations Newsletter
Assessing MapReduce for Internet Computing: A Comparison of Hadoop and BitDew-MapReduce
GRID '12 Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing
Data-Intensive Workload Consolidation for the Hadoop Distributed File System
GRID '12 Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing
Minimizing Cost of Virtual Machines for Deadline-Constrained MapReduce Applications in the Cloud
GRID '12 Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing
VC-Migration: Live Migration of Virtual Clusters in the Cloud
GRID '12 Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing
BIDE-Based parallel mining of frequent closed sequences with mapreduce
ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part II
A virtual machine consolidation framework for MapReduce enabled computing clouds
Proceedings of the 24th International Teletraffic Congress
Confidant: protecting OSN data without locking it up
Proceedings of the 12th International Middleware Conference
Resource provisioning framework for MapReduce jobs with performance goals
Proceedings of the 12th International Middleware Conference
Using clouds for MapReduce measurement assignments
ACM Transactions on Computing Education (TOCE)
MAPCloud: Mobile Applications on an Elastic and Scalable 2-Tier Cloud Architecture
UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
Adapting MPI to MapReduce PaaS Clouds: An Experiment in Cross-Paradigm Execution
UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
On the Performance of Virtualized Infrastructures for Processing Realtime Streaming Data
UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
Detecting near-duplicate documents using sentence-level features and supervised learning
Expert Systems with Applications: An International Journal
MapReduce-Based data stream processing over large history data
ICSOC'12 Proceedings of the 10th international conference on Service-Oriented Computing
Computing scientometrics in large-scale academic search engines with mapreduce
WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
Lightweight semantics over web information systems content employing knowledge tags
ER'12 Proceedings of the 2012 international conference on Advances in Conceptual Modeling
Outsourcing encryption of attribute-based encryption with mapreduce
ICICS'12 Proceedings of the 14th international conference on Information and Communications Security
Towards an agent-based symbiotic architecture for autonomic management of virtualized data centers
Proceedings of the Winter Simulation Conference
Constructing a data accessing layer for in-memory data grid
Proceedings of the Fourth Asia-Pacific Symposium on Internetware
Expediting search trend detection via prediction of query counts
Proceedings of the sixth ACM international conference on Web search and data mining
On the performance of high dimensional data clustering and classification algorithms
Future Generation Computer Systems
Future Generation Computer Systems
Scalable parallel computing on clouds using Twister4Azure iterative MapReduce
Future Generation Computer Systems
Budget optimization for online campaigns with positive carryover effects
WINE'12 Proceedings of the 8th international conference on Internet and Network Economics
Chinese medicine formula network analysis for core herbal discovery
BI'12 Proceedings of the 2012 international conference on Brain Informatics
Network-Based inference algorithm on hadoop
ISMIS'12 Proceedings of the 20th international conference on Foundations of Intelligent Systems
Applying mapreduce framework to peer-to-peer computing applications
ICCCI'12 Proceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part II
FGIT'12 Proceedings of the 4th international conference on Future Generation Information Technology
CrowdLang: a programming language for the systematic exploration of human computation systems
SocInfo'12 Proceedings of the 4th international conference on Social Informatics
Cloud Computing: Locally Sub-Clouds instead of Globally One Cloud
International Journal of Cloud Applications and Computing
International Journal of Agent Technologies and Systems
Grex: An efficient MapReduce framework for graphics processing units
Journal of Parallel and Distributed Computing
TigerQuoll: parallel event-based JavaScript
Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming
CloudPack* exploiting workload flexibility through rational pricing
Proceedings of the 13th International Middleware Conference
X10-FT: transparent fault tolerance for APGAS language and runtime
Proceedings of the 2013 International Workshop on Programming Models and Applications for Multicores and Manycores
CAP: co-scheduling based on asymptotic profiling in CPU+GPU hybrid systems
Proceedings of the 2013 International Workshop on Programming Models and Applications for Multicores and Manycores
Turbine: a distributed-memory dataflow engine for extreme-scale many-task applications
Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
A self-adapted method for the categorization of social resources
Expert Systems with Applications: An International Journal
A more formal approach to "computer science: principles"
Proceeding of the 44th ACM technical symposium on Computer science education
Tiled-MapReduce: Efficient and Flexible MapReduce Processing on Multicore with Tiling
ACM Transactions on Architecture and Code Optimization (TACO)
The grid, the load and the gradient
Natural Computing: an international journal
Elastic and effective spatio-temporal query processing scheme on Hadoop
Proceedings of the 1st ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data
Introduction to information visualisation
PROMISE'12 Proceedings of the 2012 international conference on Information Retrieval Meets Information Visualization
Producer-Consumer: the programming model for future many-core processors
ARCS'13 Proceedings of the 26th international conference on Architecture of Computing Systems
Journal of Computer and System Sciences
Graph-based semi-supervised learning with multi-modality propagation for large-scale image datasets
Journal of Visual Communication and Image Representation
Processing multi-way spatial joins on map-reduce
Proceedings of the 16th International Conference on Extending Database Technology
Sparkler: supporting large-scale matrix factorization
Proceedings of the 16th International Conference on Extending Database Technology
Scalable SAPRQL querying processing on large RDF data in cloud computing environment
ICPCA/SWS'12 Proceedings of the 2012 international conference on Pervasive Computing and the Networked World
ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
Exploiting and Evaluating MapReduce for Large-Scale Graph Mining
ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
Exploring GPU architectures to accelerate semantic comparison for intention-based search
Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units
Email marketing and scalability using Hadoop
Proceedings of the 5th ACM COMPUTE Conference: Intelligent & scalable system technologies
A task routing approach to large-scale scheduling
Future Generation Computer Systems
Revisiting flow-based load balancing: Stateless path selection in data center networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
Locating executable fragments with Concordia, a scalable, semantics-based architecture
Proceedings of the Eighth Annual Cyber Security and Information Intelligence Research Workshop
Component-based scalability for cloud applications
Proceedings of the 3rd International Workshop on Cloud Data and Platforms
Indexing and searching 100M images with map-reduce
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Speeding up model building for ECGA on CUDA platform
Proceedings of the 15th annual conference on Genetic and evolutionary computation
BigBench: towards an industry standard benchmark for big data analytics
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
High performance parallel evolutionary algorithm model based on MapReduce framework
International Journal of Computer Applications in Technology
EBM: an entropy-based model to infer social strength from spatiotemporal data
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Optimus: a dynamic rewriting framework for data-parallel execution plans
Proceedings of the 8th ACM European Conference on Computer Systems
Presto: distributed machine learning and graph processing with sparse matrices
Proceedings of the 8th ACM European Conference on Computer Systems
RadixVM: scalable address spaces for multithreaded applications
Proceedings of the 8th ACM European Conference on Computer Systems
Omega: flexible, scalable schedulers for large compute clusters
Proceedings of the 8th ACM European Conference on Computer Systems
Proceedings of the 16th International ACM Sigsoft symposium on Component-based software engineering
Evaluating MapReduce for profiling application traffic
Proceedings of the first edition workshop on High performance and programmable networking
Performance evaluation of a MongoDB and hadoop platform for scientific data analysis
Proceedings of the 4th ACM workshop on Scientific cloud computing
High performance risk aggregation: addressing the data processing challenge the hadoop mapreduce way
Proceedings of the 4th ACM workshop on Scientific cloud computing
Automatic tag recommendation for metadata annotation using probabilistic topic modeling
Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
Benchmarking approach for designing a mapreduce performance model
Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering
A throughput optimal algorithm for map task scheduling in mapreduce with data locality
ACM SIGMETRICS Performance Evaluation Review
Stream-monitoring with blockmon: convergence of network measurements and data analytics platforms
ACM SIGCOMM Computer Communication Review
Information Systems
Future Generation Computer Systems
Modeling I/O interference for data intensive distributed applications
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Building an on-demand virtual computing market in non-commercial communities
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Input data organization for batch processing in time window based computations
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Big graph mining: algorithms and discoveries
ACM SIGKDD Explorations Newsletter
Scalable RDF graph querying using cloud computing
Journal of Web Engineering
A cloud queuing service with strong consistency and high availability
IBM Journal of Research and Development
Scaling out the performance of service monitoring applications with blockmon
PAM'13 Proceedings of the 14th international conference on Passive and Active Measurement
Learning-Based interactive retrieval in large-scale multimedia collections
AMR'11 Proceedings of the 9th international conference on Adaptive Multimedia Retrieval: large-scale multimedia retrieval and evaluation
Split/merge: system support for elastic execution in virtual middleboxes
nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
Investigating hybrid SSD FTL schemes for Hadoop workloads
Proceedings of the ACM International Conference on Computing Frontiers
Designing a database system for modern processing architectures
Proceedings of the 2013 Sigmod/PODS Ph.D. symposium on PhD symposium
HyMR: a hybrid MapReduce workflow system
Proceedings of the 3rd international workshop on Emerging computational methods for the life sciences
Large-scale bisimulation of RDF graphs
Proceedings of the Fifth Workshop on Semantic Web Information Management
A high-level framework for parallelizing legacy applications for multiple platforms
Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery
Astronomical data processing in EXTASCID
Proceedings of the 25th International Conference on Scientific and Statistical Database Management
Octopus: efficient data intensive computing on virtualized datacenters
Proceedings of the 6th International Systems and Storage Conference
Participatory networking: an API for application control of SDNs
Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM
Proceedings of the 2013 ACM SIGSIM conference on Principles of advanced discrete simulation
Assisting developers of big data analytics applications when deploying on hadoop clouds
Proceedings of the 2013 International Conference on Software Engineering
Obtaining ground-truth software architectures
Proceedings of the 2013 International Conference on Software Engineering
A characteristic study on failures of production distributed data-parallel programs
Proceedings of the 2013 International Conference on Software Engineering
RPC automation: making legacy code relevant
Proceedings of the 8th International Symposium on Software Engineering for Adaptive and Self-Managing Systems
Scalable all-pairs similarity search in metric spaces
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Scalable processing of flexible graph pattern queries on the cloud
Proceedings of the 22nd international conference on World Wide Web companion
A survey of web archive search architectures
Proceedings of the 22nd international conference on World Wide Web companion
Graph-based malware distributors detection
Proceedings of the 22nd international conference on World Wide Web companion
Adaptive online scheduling in storm
Proceedings of the 7th ACM international conference on Distributed event-based systems
Job scheduling for optimizing data locality in Hadoop clusters
Proceedings of the 20th European MPI Users' Group Meeting
International Journal of Web and Grid Services
Scientific computing with Google App Engine
Future Generation Computer Systems
Performance comparison under failures of MPI and MapReduce: An analytical approach
Future Generation Computer Systems
Efficient social network data query processing on MapReduce
Proceedings of the 5th ACM workshop on HotPlanet
Automatic failure recovery for software-defined networks
Proceedings of the second ACM SIGCOMM workshop on Hot topics in software defined networking
Distributed community detection in web-scale networks
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Extracting Knowledge from Wikipedia Articles through Distributed Semantic Analysis
Proceedings of the 13th International Conference on Knowledge Management and Knowledge Technologies
Autonomous, failure-resilient orchestration of distributed discrete event simulations
Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference
A case for MapReduce over the internet
Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference
Mammoth: autonomic data processing framework for scientific state-transition applications
Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference
Simulation process support for climate data analysis
Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference
Cloud MapReduce for particle filter-based data assimilation for wildfire spread simulation
Proceedings of the High Performance Computing Symposium
Multiple objective scheduling of HPC workloads through dynamic prioritization
Proceedings of the High Performance Computing Symposium
Direct out-of-memory distributed parallel frequent pattern mining
Proceedings of the 2nd International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
The state of peer-to-peer network simulators
ACM Computing Surveys (CSUR)
Scalable and incremental software bug detection
Proceedings of the 2013 9th Joint Meeting on Foundations of Software Engineering
Audience segment expansion using distributed in-database k-means clustering
Proceedings of the Seventh International Workshop on Data Mining for Online Advertising
Proceedings of the 2nd ACM SIGPLAN workshop on Functional high-performance computing
Channel reservation protocol for over-subscribed channels and destinations
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Optimization of cloud task processing with checkpoint-restart mechanism
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Algorithms for high-throughput disk-to-disk sorting
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Adaptive atomic capture of multiple molecules
Journal of Parallel and Distributed Computing
Actor scheduling for multicore hierarchical memory platforms
Proceedings of the twelfth ACM SIGPLAN workshop on Erlang
An efficient MapReduce algorithm for counting triangles in a very large graph
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
"All roads lead to Rome": optimistic recovery for distributed iterative data processing
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Prolog programming with a map-reduce parallel construct
Proceedings of the 15th Symposium on Principles and Practice of Declarative Programming
Boosting energy efficiency with mirrored data block replication policy and energy scheduler
ACM SIGOPS Operating Systems Review
Banking on decoupling: budget-driven sustainability for HPC applications on auction-based clouds
ACM SIGOPS Operating Systems Review
Designing and testing a pool-based evolutionary algorithm
Natural Computing: an international journal
Distributed matrix factorization with mapreduce using a series of broadcast-joins
Proceedings of the 7th ACM conference on Recommender systems
Development of a virtualized supercomputing environment for genomic analysis
The Journal of Supercomputing
An online service-oriented performance profiling tool for cloud computing systems
Frontiers of Computer Science: Selected Publications from Chinese Universities
Mining search and browse logs for web search: A Survey
ACM Transactions on Intelligent Systems and Technology (TIST) - Survey papers, special sections on the semantic adaptive social web, intelligent systems for health informatics, regular papers
Efficient programming paradigm for video streaming processing on TILE64 platform
The Journal of Supercomputing
River trail: a path to parallelism in JavaScript
Proceedings of the 2013 ACM SIGPLAN international conference on Object oriented programming systems languages & applications
Enforcing Minimum Necessary Access in Healthcare Through Integrated Audit and Access Control
Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
Can we analyze big data inside a DBMS?
Proceedings of the sixteenth international workshop on Data warehousing and OLAP
Monitoring social relationship among Twitter users by using NodeXL
Proceedings of the 2013 Research in Adaptive and Convergent Systems
SMINER - a platform for data mining based on service-oriented architecture
International Journal of Business Intelligence and Data Mining
Clustering on the cloud: reducing CLARA to MapReduce
Proceedings of the Second Nordic Symposium on Cloud Computing & Internet Technologies
A cloud computing based framework for general 2D and 3D cellular automata simulation
Advances in Engineering Software
Collaborative pseudo-relevance feedback
Expert Systems with Applications: An International Journal
Scalable multimedia content analysis on parallel platforms using python
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
ACM SIGOPS 24th Symposium on Operating Systems Principles
Data warehousing and OLAP over big data: current challenges and future research directions
Proceedings of the sixteenth international workshop on Data warehousing and OLAP
Performance Modeling and Optimization of Deadline-Driven Pig Programs
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Tango: distributed data structures over a shared log
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
The family of mapreduce and large-scale data processing systems
ACM Computing Surveys (CSUR)
A data-centric heuristic for Hadoop provisioning in the cloud
Proceedings of the 6th ACM India Computing Convention
Proceedings of the 4th annual Symposium on Cloud Computing
Apache Hadoop YARN: yet another resource negotiator
Proceedings of the 4th annual Symposium on Cloud Computing
Fast multi-fields query processing in bigtable based cloud systems
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
CG_Hadoop: computational geometry in MapReduce
Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Cloud-aware processing of MapReduce-based OLAP applications
AusPDC '13 Proceedings of the Eleventh Australasian Symposium on Parallel and Distributed Computing - Volume 140
Utility-Driven share scheduling algorithm in hadoop
ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part II
Extending modern PaaS clouds with BSP to execute legacy MPI applications
Proceedings of the 4th annual Symposium on Cloud Computing
XDB: a novel database architecture for data analytics as a service
Proceedings of the 4th annual Symposium on Cloud Computing
Does RDMA-based enhanced Hadoop MapReduce need a new performance model?
Proceedings of the 4th annual Symposium on Cloud Computing
BNCOD'13 Proceedings of the 29th British National conference on Big Data
Bisimulation reduction of big graphs on mapreduce
BNCOD'13 Proceedings of the 29th British National conference on Big Data
USTO.RE: a private cloud storage software system
ICWE'13 Proceedings of the 13th international conference on Web Engineering
Proceedings of the 17th International Database Engineering & Applications Symposium
SAAD, a content based Web Spam Analyzer and Detector
Journal of Systems and Software
Game-based scheduling algorithm to achieve optimize profit in mapreduce environment
ICIC'13 Proceedings of the 9th international conference on Intelligent Computing Theories
MapReduce performance evaluation for knowledge-based recommendation of context-tagged photos
Proceedings of the 19th Brazilian symposium on Multimedia and the web
MR-runner: a modularized map-reduce job management tool
Proceedings of the 5th Asia-Pacific Symposium on Internetware
Joint optimization of overlapping phases in MapReduce
Performance Evaluation
CRUCIBLE: towards unified secure on- and off-line analytics at scale
DISCS-2013 Proceedings of the 2013 International Workshop on Data-Intensive Scalable Computing Systems
BDMPI: conquering BigData with small clusters using MPI
DISCS-2013 Proceedings of the 2013 International Workshop on Data-Intensive Scalable Computing Systems
Design of an active storage cluster file system for DAG workflows
DISCS-2013 Proceedings of the 2013 International Workshop on Data-Intensive Scalable Computing Systems
When big data meets big smog: a big spatio-temporal data framework for China severe smog analysis
Proceedings of the 2nd ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data
Optimization strategies for A/B testing on HADOOP
Proceedings of the VLDB Endowment
MillWheel: fault-tolerant stream processing at internet scale
Proceedings of the VLDB Endowment
Online, asynchronous schema change in F1
Proceedings of the VLDB Endowment
Parallel graph processing on graphics processors made easy
Proceedings of the VLDB Endowment
Piggybacking on social networks
Proceedings of the VLDB Endowment
Scorpion: explaining away outliers in aggregate queries
Proceedings of the VLDB Endowment
Sharing data and work across concurrent analytical queries
Proceedings of the VLDB Endowment
Making queries tractable on big data with preprocessing: through the eyes of complexity theory
Proceedings of the VLDB Endowment
Hardware-oblivious parallelism for in-memory column-stores
Proceedings of the VLDB Endowment
Power-aware dynamic memory management on many-core platforms utilizing DVFS
ACM Transactions on Embedded Computing Systems (TECS) - Special Section on ESTIMedia'10
HiPCNA-PG '13 Proceedings of the 3rd International Workshop on High Performance Computing, Networking and Analytics for the Power Grid
Rapid processing of remote sensing images based on cloud computing
Future Generation Computer Systems
Taking a walk on the wild side: teaching cloud computing on distributed research testbeds
Proceedings of the 45th ACM technical symposium on Computer science education
Design and Evaluation of Lifelog Mashup Platform with NoSQL Database
Proceedings of International Conference on Information Integration and Web-based Applications & Services
Monte-Carlo expectation maximization for decentralized POMDPs
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Forward perimeter search with controlled use of memory
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Towards privacy-preserving computing on distributed electronic health record data
Proceedings of the 2013 Middleware Doctoral Symposium
Data-parallel finite-state machines
Proceedings of the 19th international conference on Architectural support for programming languages and operating systems
Efficient query evaluation on distributed graphs with Hadoop environment
Proceedings of the Fourth Symposium on Information and Communication Technology
Minimizing data transfers for regular reachability queries on distributed graphs
Proceedings of the Fourth Symposium on Information and Communication Technology
DBridges: Flexible floodless frame forwarding
Computer Networks: The International Journal of Computer and Telecommunications Networking
Identifying abnormal patterns in cellular communication flows
Proceedings of Principles, Systems and Applications on IP Telecommunications
Resilient X10: efficient failure-aware programming
Proceedings of the 19th ACM SIGPLAN symposium on Principles and practice of parallel programming
Workload characteristics of DNA sequence analysis: from storage systems' perspective
Proceedings of the 6th Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools
MapReduce "garbage" collection
CASCON '13 Proceedings of the 2013 Conference of the Center for Advanced Studies on Collaborative Research
FENNEL: streaming graph partitioning for massive scale graphs
Proceedings of the 7th ACM international conference on Web search and data mining
Instant loading for main memory databases
Proceedings of the VLDB Endowment
Parallel computation of skyline and reverse skyline queries using mapreduce
Proceedings of the VLDB Endowment
User behavior learning and transfer in composite social networks
ACM Transactions on Knowledge Discovery from Data (TKDD) - Casin special issue
A Large-scale Images Processing Model Based on Hadoop Platform
Proceedings of the Second International Conference on Innovative Computing and Cloud Computing
Scalable model of parallel computations for applications with intensive input-output
Journal of Computer and Systems Sciences International
Power consumption evaluation of all-optical data center networks
Cluster Computing
Analysis of I/O Performance on an Amazon EC2 Cluster Compute and High I/O Platform
Journal of Grid Computing
Programming a Multicore Architecture without Coherency and Atomic Operations
Proceedings of Programming Models and Applications on Multicores and Manycores
Scalable power grid transient analysis via MOR-assisted time-domain simulations
Proceedings of the International Conference on Computer-Aided Design
Quantitative reactive modeling and verification
Computer Science - Research and Development
Achieving Accountable MapReduce in cloud computing
Future Generation Computer Systems
Partial-update dimensionality reduction for accumulating co-occurrence events
Pattern Recognition Letters
Security and privacy for storage and computation in cloud computing
Information Sciences: an International Journal
Generating storylines from sensor data
Pervasive and Mobile Computing
Text Categorization of Biomedical Data Sets Using Graph Kernels and a Controlled Vocabulary
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
VM consolidation: A real case based on OpenStack Cloud
Future Generation Computer Systems
Towards software performance engineering for multicore and manycore systems
ACM SIGMETRICS Performance Evaluation Review
Joint optimization of overlapping phases in MapReduce
ACM SIGMETRICS Performance Evaluation Review
Dimension independent similarity computation
The Journal of Machine Learning Research
Reduce and aggregate: similarity ranking in multi-categorical bipartite graphs
Proceedings of the 23rd international conference on World wide web
Continuous validation of load test suites
Proceedings of the 5th ACM/SPEC international conference on Performance engineering
Scalable hybrid stream and hadoop network analysis system
Proceedings of the 5th ACM/SPEC international conference on Performance engineering
SHadoop: Improving MapReduce performance by optimizing job execution mechanism in Hadoop clusters
Journal of Parallel and Distributed Computing
Speeding-up codon analysis on the cloud with local MapReduce aggregation
Information Sciences: an International Journal
Parallel labeling of massive XML data with MapReduce
The Journal of Supercomputing
A compound OpenMP/MPI program development toolkit for hybrid CPU/GPU clusters
The Journal of Supercomputing
Exploiting inter-operation parallelism for matrix chain multiplication using MapReduce
The Journal of Supercomputing
An improved partitioning mechanism for optimizing massive data analysis using MapReduce
The Journal of Supercomputing
Review: A survey on architectures and energy efficiency in Data Center Networks
Computer Communications
The SAMS: Smartphone Addiction Management System and Verification
Journal of Medical Systems
CPU+GPU scheduling with asymptotic profiling
Parallel Computing
X10-FT: Transparent fault tolerance for APGAS language and runtime
Parallel Computing
WHAD: Wikipedia historical attributes data
Language Resources and Evaluation
A MapReduce task scheduling algorithm for deadline constraints
Cluster Computing
Scalable and Real-Time Deep Packet Inspection
UCC '13 Proceedings of the 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing
Experimental Study on the Energy Consumption in IaaS Cloud Environments
UCC '13 Proceedings of the 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing
A Language Based Security Approach for Securing Map-Reduce Computations in the Cloud
UCC '13 Proceedings of the 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing
NEWT - A Fault Tolerant BSP Framework on Hadoop YARN
UCC '13 Proceedings of the 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing
International Journal of Approximate Reasoning
SpringFS: bridging agility and performance in elastic distributed storage
FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
WOOster: a map-reduce based platform for graph mining
Proceedings of the 17th International Conference on Management of Data
Approaches to Distributed Execution of Scientific Workflows in Kepler
Fundamenta Informaticae - Scalable Workflow Enactment Engines and Technology
Turbine: A Distributed-memory Dataflow Engine for High Performance Many-task Applications
Fundamenta Informaticae - Scalable Workflow Enactment Engines and Technology
Development of an intelligent distributed news retrieval system
International Journal of Knowledge-based and Intelligent Engineering Systems
Energy and locality aware load balancing in cloud computing
Integrated Computer-Aided Engineering
Journal of High Speed Networks
WIKI: :SCORE A collaborative environment for music transcription and publishing
Information Services and Use - 16th International Conference on Electronic Publishing --ELPUB 2012 --Social Shaping of Digital Publishing: Exploring the Interplay between Culture and Technology
Nephele streaming: stream processing under QoS constraints at scale
Cluster Computing
MapReduce framework energy adaptation via temperature awareness
Cluster Computing
Design of reliable virtual infrastructure with resource sharing
Computer Networks: The International Journal of Computer and Telecommunications Networking
Trends and outlook for the massive-scale analytics stack
IBM Journal of Research and Development
Scalable community detection in massive social networks using MapReduce
IBM Journal of Research and Development
GRASS: trimming stragglers in approximation analytics
NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation
Hi-index | 0.02 |
MapReduce is a programming model and an associated implementation for processing and generating large datasets that is amenable to a broad variety of real-world tasks. Users specify the computation in terms of a map and a reduce function, and the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks. Programmers find the system easy to use: more than ten thousand distinct MapReduce programs have been implemented internally at Google over the past four years, and an average of one hundred thousand MapReduce jobs are executed on Google's clusters every day, processing a total of more than twenty petabytes of data per day.