3 Tutorials on Using R with Hadoop

3 Tutorials on Using R with Hadoop

Jeffrey Breen of Atmosphere Research Group presented at tk how to use Apache Hadoop with the statistical programming language R using RHadoop. Hadoop has become practically synonymous with big data and R has become the language of choice for data scientists so it’s natural to want to use the two together.

Breen has made his presentations available on SlideShare and the code and configuration files available on Github.

The first tutorial explains how to install Hadoop on a local virtual machine to help you get familiar with Hadoop:

The second guides you through the process of setting up R and RStudio on an Amazon Web Services EC2 instance:

The final presentation demonstrates how to launch a Hadoop cluster on EC2 using Apache Whirr.

RELATED:  How one company helps deploy data pipelines in minutes | #StructureConf

Klint Finley

Klint Finley is a Senior Writer at SiliconAngle. His specialties
include IT services, enterprise technology and software development.
Prior to SiliconAngle he was a writer for ReadWriteWeb. He's also a
former IT practicioner, and has written about technology for over a
decade. He can be contacted at angle@klintfinley.com.


Join our mailing list to receive the latest news and updates from our team.

Submit a Comment

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

Share This

Share This

Share this post with your friends!