Welcome to Scribd, the world's digital library. Read, publish, and share books and documents. See more
Buy Now $19.99
Standard view
Full view
of .
Look up keyword
Like this
4Activity
P. 1
Parallel R

Parallel R

Ratings:

3.5

(2)
|Views: 136 |Likes:

It’s tough to argue with R as a high-quality, cross-platform, open source statistical software product—unless you’re in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets, including three chapters on using R and Hadoop together. You’ll learn the basics of Snow, Multicore, Parallel, Segue, RHIPE, and Hadoop Streaming, including how to find them, how to use them, when they work well, and when they don’t.

With these packages, you can overcome R’s single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address R’s memory barrier.

Snow: works well in a traditional cluster environment Multicore: popular for multiprocessor and multicore computers Parallel: part of the upcoming R 2.14.0 release R+Hadoop: provides low-level access to a popular form of cluster computing RHIPE: uses Hadoop’s power with R’s language and interactive shell Segue: lets you use Elastic MapReduce as a backend for lapply-style operations

It’s tough to argue with R as a high-quality, cross-platform, open source statistical software product—unless you’re in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets, including three chapters on using R and Hadoop together. You’ll learn the basics of Snow, Multicore, Parallel, Segue, RHIPE, and Hadoop Streaming, including how to find them, how to use them, when they work well, and when they don’t.

With these packages, you can overcome R’s single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address R’s memory barrier.

Snow: works well in a traditional cluster environment Multicore: popular for multiprocessor and multicore computers Parallel: part of the upcoming R 2.14.0 release R+Hadoop: provides low-level access to a popular form of cluster computing RHIPE: uses Hadoop’s power with R’s language and interactive shell Segue: lets you use Elastic MapReduce as a backend for lapply-style operations

More info:

Publish date: Oct 21, 2011
Added to Scribd: Jan 15, 2013
Copyright:Traditional Copyright: All rights reservedISBN:9781449320348
List Price: $19.99 Buy Now

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
This book can be read on up to 6 mobile devices.
See more
See less

11/04/2014

122

9781449320348

$19.99

USD

You're Reading a Free Preview
Pages 5 to 14 are not shown in this preview.
You're Reading a Free Preview
Pages 19 to 30 are not shown in this preview.
You're Reading a Free Preview
Pages 35 to 58 are not shown in this preview.
You're Reading a Free Preview
Pages 63 to 115 are not shown in this preview.
You're Reading a Free Preview
Pages 120 to 122 are not shown in this preview.

Activity (4)

You've already reviewed this. Edit your review.
0 million reads
1 hundred thousand reads
1 thousand reads
1 hundred reads

You're Reading a Free Preview

Download
scribd
/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->