Proceedings of the WICSA/ECSA 2012 Companion Volume
Toward scalable internet traffic measurement and analysis with Hadoop
ACM SIGCOMM Computer Communication Review
MRSG - A MapReduce simulator over SimGrid
Parallel Computing
Exploiting MapReduce and data compression for data-intensive applications
Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery
RPC automation: making legacy code relevant
Proceedings of the 8th International Symposium on Software Engineering for Adaptive and Self-Managing Systems
Crowdsourcing MapReduce: JSMapReduce
Proceedings of the 22nd international conference on World Wide Web companion
Adaptive online scheduling in storm
Proceedings of the 7th ACM international conference on Distributed event-based systems
DynamicCloudSim: simulating heterogeneity in computational clouds
Proceedings of the 2nd ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
An efficient MapReduce algorithm for counting triangles in a very large graph
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
The family of mapreduce and large-scale data processing systems
ACM Computing Surveys (CSUR)
A single-domain, representation-learning model for big data classification of network intrusion
MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Semantics and provenance for processing element composition in dispel workflows
WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
Design of an active storage cluster file system for DAG workflows
DISCS-2013 Proceedings of the 2013 International Workshop on Data-Intensive Scalable Computing Systems
Proceedings of the 2nd ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data
Introducing spatial context in comparative pricing and product search
Proceedings of the Fifth International Conference on Management of Emergent Digital EcoSystems
Integrating big data into the computing curricula
Proceedings of the 45th ACM technical symposium on Computer science education
A big data based data storage systems for rock burst experiment
International Journal of Wireless and Mobile Computing
An adaptable system for RGB-D based human body detection and pose estimation
Journal of Visual Communication and Image Representation
Scalable hybrid stream and hadoop network analysis system
Proceedings of the 5th ACM/SPEC international conference on Performance engineering
Benchmarking graph-processing platforms: a vision
Proceedings of the 5th ACM/SPEC international conference on Performance engineering
Speeding-up codon analysis on the cloud with local MapReduce aggregation
Information Sciences: an International Journal
Hi-index | 0.00 |
Ready to unlock the power of your data? With this comprehensive guide, youll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters. Youll find illuminating case studies that demonstrate how Hadoop is used to solve specific problems. This third edition covers recent changes to Hadoop, including material on the new MapReduce API, as well as MapReduce 2 and its more flexible execution model (YARN).Store large datasets with the Hadoop Distributed File System (HDFS) Run distributed computations with MapReduce Use Hadoops data and I/O building blocks for compression, data integrity, serialization (including Avro), and persistence Discover common pitfalls and advanced features for writing real-world MapReduce programs Design, build, and administer a dedicated Hadoop clusteror run Hadoop in the cloud Load data from relational databases into HDFS, using Sqoop Perform large-scale data processing with the Pig query language Analyze datasets with Hive, Hadoops data warehousing system Take advantage of HBase for structured and semi-structured data, and ZooKeeper for building distributed systems