UPDATED 10:16 EDT / MARCH 23 2012

NEWS

3 Tutorials on Using R with Hadoop

Jeffrey Breen of Atmosphere Research Group presented at tk how to use Apache Hadoop with the statistical programming language R using RHadoop. Hadoop has become practically synonymous with big data and R has become the language of choice for data scientists so it’s natural to want to use the two together.

Breen has made his presentations available on SlideShare and the code and configuration files available on Github.

The first tutorial explains how to install Hadoop on a local virtual machine to help you get familiar with Hadoop:

The second guides you through the process of setting up R and RStudio on an Amazon Web Services EC2 instance:

The final presentation demonstrates how to launch a Hadoop cluster on EC2 using Apache Whirr.


A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU