The Fifth Elephant 2012

Finding the elephant in the data.

What are your users doing on your website or in your store? How do you turn the piles of data your organization generates into actionable information? Where do you get complementary data to make yours more comprehensive? What tech, and what techniques?

The Fifth Elephant is a two day conference on big data.

Early Geek tickets are available from

The proposal funnel below will enable you to submit a session and vote on proposed sessions. It is a good practice introduce yourself and share details about your work as well as the subject of your talk while proposing a session.

Each community member can vote for or against a talk. A vote from each member of the Editorial Panel is equivalent to two community votes. Both types of votes will be considered for final speaker selection.

It’s useful to keep a few guidelines in mind while submitting proposals:

  1. Describe how to use something that is available under a liberal open source license. Participants can use this without having to pay you anything.

  2. Tell a story of how you did something. If it involves commercial tools, please explain why they made sense.

  3. Buy a slot to pitch whatever commercial tool you are backing.

Speakers will get a free ticket to both days of the event. Proposers whose talks are not on the final schedule will be able to purchase tickets at the Early Geek price of Rs. 1800.

Hosted by

All about data science and machine learning

Onkar Hoysala


Structuring data from surveys - A case study from a slum survey conducted in 2010

Submitted May 8, 2012

To present a case study of the slum survey conducted by NGIL, CSTEP in 2010 and to discuss the data collection and cleanup process, and the challenges that were faced.


An extensive survey was carried out across 1000+ households in 36 slums across Bangalore by NGIL, CSTEP in 2010. We will talk about the process of data collection and cleanup and the challenges that we faced during the data collection and during cleanup. We will also give an overview of the database schema we designed for the survey.

Speaker bio

Aditi Murthy is a researcher at Fields of View, a non-profit organisation working on problems related to urban systems and public safety & security. She has two years experience in working with market simulations, game design and working with datasets. She was involved in the cleanup process of the slum survey data at NGIL and is involved in its analysis at Fields of View.

Onkar Hoysala is a researcher at Fields of View. He has 2 years research experience developing games and simulations for urban systems and disaster management. He was involved in the data cleanup and schema design for the slum database at NGIL and is trying to look at using the data to develop models at Fields of View.



{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

All about data science and machine learning