The Fifth Elephant 2013

An Event on Big Data and Cloud Computing

Anand

@anandk

RHadoop: Marrying analytics & large scale data processing

Submitted Jun 2, 2013

  1. Why Hadoop is not analytics
  2. Why “Big Data” is not analytics
  3. Quick overview of performing analytics over a large data set (without RHadoop)
  4. Easing the pain with RHadoop: RHadoop tutorial session

Outline

This session will quickly segue from helping the audience realise the difference between “big data” & analytics into a hands-on about writing using R for performing analytics on a large dataset (using naive distributed computing) followed by a hands-on session on using RHadoop.

Requirements

  1. Familiarity with R would help though is not mandatory
  2. Some familiarity with statistics would help map jargon

Speaker bio

Anand Krishnaswamy is a developer at ThoughtWorks. His background spans filesystem development, storage management solution development, class library & compiler design & development & recently, web-app development. His interests in data analytics is self-developed. His other interests range from photography & cooking to writing & painting.

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

Jump starting better data engineering and AI futures