The Fifth Elephant 2013

An Event on Big Data and Cloud Computing

Data Analysis and Visualization using R

Submitted by Vinayak Hegde (@vin) on Wednesday, 5 June 2013

videocam_off

Technical level

Intermediate

Section

Workshops

Status

Confirmed

Vote on this proposal

Login to vote

Total votes:  +21

Objective

  1. To provide an intermediate-to-advanced usage of R
  2. To do a deep-dive into different R modules using public data sets

This is a good primer for someone who is using R for exploring and analyzing small (< 1MB) or medium-size (10s of MBs to 1 GB) datasets

Description

The workshop will work on using R to uncover patterns, anomalies and insights in public data sets. The workshop will demonstrate on how to use R for statistics analysis using different modules and techniques. A significant part of the workshop will also be dedicated to visually exploring patterns and anomalies in data using R modules such as ggplot2.

Requirements

  1. Some familiarity with R (how to start the interpreter, data structures, how to load modules)
  2. Installation of R Studio
  3. Basic knowledge of statistics (mean, median, distributions) etc

Speaker bio

Vinayak Hegde has been working with large scale data and analytics for several years for MNCs such as Inmobi and Akamai. He used R as a part of Marketplace team in Inmobi to improve Ad-serving relevance and performance.

His areas of expertise are data analytics and large scale networks . He is a polyglot and often works with multiple languages to build robust and scalable software systems.

Links

Comments

Login with Twitter or Google to leave a comment