The Fifth Elephant 2013

An Event on Big Data and Cloud Computing

Vinayak Hegde

Vinayak Hegde

@vin

Data Analysis and Visualization using R

Submitted Jun 5, 2013

  1. To provide an intermediate-to-advanced usage of R
  2. To do a deep-dive into different R modules using public data sets

This is a good primer for someone who is using R for exploring and analyzing small (< 1MB) or medium-size (10s of MBs to 1 GB) datasets

Outline

The workshop will work on using R to uncover patterns, anomalies and insights in public data sets. The workshop will demonstrate on how to use R for statistics analysis using different modules and techniques. A significant part of the workshop will also be dedicated to visually exploring patterns and anomalies in data using R modules such as ggplot2.

Requirements

  1. Some familiarity with R (how to start the interpreter, data structures, how to load modules)
  2. Installation of R Studio
  3. Basic knowledge of statistics (mean, median, distributions) etc

Speaker bio

Vinayak Hegde has been working with large scale data and analytics for several years for MNCs such as Inmobi and Akamai. He used R as a part of Marketplace team in Inmobi to improve Ad-serving relevance and performance.

His areas of expertise are data analytics and large scale networks . He is a polyglot and often works with multiple languages to build robust and scalable software systems.

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

Jump starting better data engineering and AI futures