The Fifth Elephant 2013

An Event on Big Data and Cloud Computing

RHadoop: Marrying analytics & large scale data processing

Submitted by Anand (@anandk) on Sunday, 2 June 2013

Section: Analytics and Visualization Technical level: Beginner

Abstract

  1. Why Hadoop is not analytics
  2. Why "Big Data" is not analytics
  3. Quick overview of performing analytics over a large data set (without RHadoop)
  4. Easing the pain with RHadoop: RHadoop tutorial session

Outline

This session will quickly segue from helping the audience realise the difference between "big data" & analytics into a hands-on about writing using R for performing analytics on a large dataset (using naive distributed computing) followed by a hands-on session on using RHadoop.

Requirements

  1. Familiarity with R would help though is not mandatory
  2. Some familiarity with statistics would help map jargon

Speaker bio

Anand Krishnaswamy is a developer at ThoughtWorks. His background spans filesystem development, storage management solution development, class library & compiler design & development & recently, web-app development. His interests in data analytics is self-developed. His other interests range from photography & cooking to writing & painting.

Comments

Login with Twitter or Google to leave a comment