Parallel R

  • Authors:
  • Q. Ethan McCallum;Stephen Weston

  • Affiliations:
  • -;-

  • Venue:
  • Parallel R
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Its tough to argue with R as a high-quality, cross-platform, open source statistical software productunless youre in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets. Youll learn the basics of Snow, Multicore, Parallel, and some Hadoop-related tools, including how to find them, how to use them, when they work well, and when they dont. With these packages, you can overcome Rs single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address Rs memory barrier.Snow: works well in a traditional cluster environment Multicore: popular for multiprocessor and multicore computers Parallel: part of the upcoming R 2.14.0 release R+Hadoop: provides low-level access to a popular form of cluster computing RHIPE: uses Hadoops power with Rs language and interactive shell Segue: lets you use Elastic MapReduce as a backend for lapply-style operations