The Fifth Elephant 2013

An Event on Big Data and Cloud Computing

RHadoop: Marrying analytics & large scale data processing

Submitted by Anand (@anandk) on Sunday, 2 June 2013

videocam_off

Technical level

Beginner

Section

Analytics and Visualization

Status

Submitted

Vote on this proposal

Login to vote

Total votes:  +13

Objective

  1. Why Hadoop is not analytics
  2. Why "Big Data" is not analytics
  3. Quick overview of performing analytics over a large data set (without RHadoop)
  4. Easing the pain with RHadoop: RHadoop tutorial session

Description

This session will quickly segue from helping the audience realise the difference between "big data" & analytics into a hands-on about writing using R for performing analytics on a large dataset (using naive distributed computing) followed by a hands-on session on using RHadoop.

Requirements

  1. Familiarity with R would help though is not mandatory
  2. Some familiarity with statistics would help map jargon

Speaker bio

Anand Krishnaswamy is a developer at ThoughtWorks. His background spans filesystem development, storage management solution development, class library & compiler design & development & recently, web-app development. His interests in data analytics is self-developed. His other interests range from photography & cooking to writing & painting.

Comments

Login with Twitter or Google to leave a comment