The Fifth Elephant 2013

An Event on Big Data and Cloud Computing

Interactive analysis of data live, using Pandas, Matplotlib and IPython

Submitted by Lakshman Prasad (@becomingguru) on Wednesday, 1 May 2013

videocam_off

Technical level

Beginner

Section

Analytics and Visualization

Status

Confirmed

Vote on this proposal

Login to vote

Total votes:  +26

Objective

The session is a live coding session to analyse various datasets using Pandas and plotting them live, in an IPython notebook.

There has been a surge in the development of SciPy tools and it's adoption has seen an unprecedented increase recently because it can be used for both interactive analysis and run in production.

The session hopes to give the audience a short tour of the data analysis and visualisation using these scientific python tools by doing some analysis live on different data sets.

Description

One of the data sets that is going to be the used is the dataset parsed from usesthis.com: The hardware and software used by people to get their work done. (permission for the same from the site owner has been obtained.)

The audience is going to be a part of the whole process of parsing it and converting it into numpy arrays whereupon it can be analysed to find various answers.

Another dataset would be the names of people in the US social security database since 1880 with 3 million published name records.

Speaker bio

The speaker has been working on Python for years and SciPy tools have always interested him. Recently he took the time to dive into it and has been pleased with what he learnt so far, which he can't wait to share!

Comments

  • 2
    Matthias Bussonnier (@Mbussonn) 5 years ago

    IPython, not iPython. We do not want to be sued by the iFruit brand :-)

    • 2
      Lakshman Prasad (@becomingguru) Proposer 5 years ago (edited 5 years ago)

      You are right. I corrected it now. iBad!

  • 1
    SriSreedhar (@srisreedhar) 5 years ago

    Awesome, iam Waiting !!

Login with Twitter or Google to leave a comment