The Fifth Elephant 2012

Finding the elephant in the data.

What are your users doing on your website or in your store? How do you turn the piles of data your organization generates into actionable information? Where do you get complementary data to make yours more comprehensive? What tech, and what techniques?

The Fifth Elephant is a two day conference on big data.

Early Geek tickets are available from fifthelephant.doattend.com.

The proposal funnel below will enable you to submit a session and vote on proposed sessions. It is a good practice introduce yourself and share details about your work as well as the subject of your talk while proposing a session.

Each community member can vote for or against a talk. A vote from each member of the Editorial Panel is equivalent to two community votes. Both types of votes will be considered for final speaker selection.

It’s useful to keep a few guidelines in mind while submitting proposals:

  1. Describe how to use something that is available under a liberal open source license. Participants can use this without having to pay you anything.

  2. Tell a story of how you did something. If it involves commercial tools, please explain why they made sense.

  3. Buy a slot to pitch whatever commercial tool you are backing.

Speakers will get a free ticket to both days of the event. Proposers whose talks are not on the final schedule will be able to purchase tickets at the Early Geek price of Rs. 1800.

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Gaurav Agarwal

@calven321

The Elephant that Flew - Big Data Analytics @ InMobi

Submitted Jun 10, 2012

A discussion on the evolution of bigdata systems within InMobi and a discussion/demo of an in-house data processing and analytics system for large data at InMobi scale.

Outline

Data processing, analysis and visualization of data at a scale at which web systems (like InMobi) operate, is a very hard problem. I’ll discuss the evolution of the data analytics at InMobi - the approaches that we tried, the challenges faced, and our rationale to develop an in-house analytics system on top of Hadoop. I will be discussing the details of this system (moderate level), how it is helping us attain better efficiency levels and what future directions could it take. I will also be doing a demo of the capabilities of this system towards the end of the talk.

Requirements

Basic familiarity with the concepts of data analytics and Hadoop.

Speaker bio

Gaurav Agarwal currently leads the Data Analytics system at InMobi. He has 5+ years of experience building analytics for extremely large amounts of data at InMobi and, previously, at Google. He holds a Masters degree in Computer Science from University of California.

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more