Submissions

The Fifth Elephant 2012

Finding the elephant in the data.

What are your users doing on your website or in your store? How do you turn the piles of data your organization generates into actionable information? Where do you get complementary data to make yours more comprehensive? What tech, and what techniques?

The Fifth Elephant is a two day conference on big data.

Early Geek tickets are available from fifthelephant.doattend.com.

The proposal funnel below will enable you to submit a session and vote on proposed sessions. It is a good practice introduce yourself and share details about your work as well as the subject of your talk while proposing a session.

Each community member can vote for or against a talk. A vote from each member of the Editorial Panel is equivalent to two community votes. Both types of votes will be considered for final speaker selection.

It’s useful to keep a few guidelines in mind while submitting proposals:

  1. Describe how to use something that is available under a liberal open source license. Participants can use this without having to pay you anything.

  2. Tell a story of how you did something. If it involves commercial tools, please explain why they made sense.

  3. Buy a slot to pitch whatever commercial tool you are backing.

Speakers will get a free ticket to both days of the event. Proposers whose talks are not on the final schedule will be able to purchase tickets at the Early Geek price of Rs. 1800.

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Accepting submissions

Not accepting submissions

PhanindraSarma Proposing

Hadoop and its Eco System

Objective is to understand Architecture and usage of Hadoop and tools around Hadoop more
  • 5 comments
  • Submitted
  • 03 Oct 2011
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Akbar Ladak

Video Analytics for live streams

Gleaning data out of live streams - TV, radio, surveillance & sports. more
  • 1 comment
  • Submitted
  • 10 Oct 2011
Section: Data Analytics Technical level: Beginner Session type: Demo

Vijay Rayapati

Lessons Learned : Scaling Hadoop and BigData in Cloud (Amazon EMR )

High level technology and business perspective around BigData including how to and why leverage cloud based platforms like Amazon EMR along with Map Reduce for data analysis. more
  • 1 comment
  • Submitted
  • 20 Mar 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Anand S

Visualising Text

How can text be analysed quantitatively? How can it be visualised? What tools exist today that I can use? more
  • 0 comments
  • Confirmed & scheduled
  • 29 Mar 2012
Section: Data Visualization Technical level: Intermediate Session type: Tutorial

Goldee Udani

Finding signal in the noise: How to put big data to use

How one can derive actionable insights from big data using Machine Learning, Natural Language Processing and Statistical Modeling more
  • 0 comments
  • Confirmed & scheduled
  • 29 Mar 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Uttam Grandhi

An Introduction to Data Visualization & Processing ( Programming Language)

To excite the audience into the world of Data Visualization through a gateway called Processing ( Programming Language ). more
  • 12 comments
  • Confirmed & scheduled
  • 23 Apr 2012
Section: Data Visualization Technical level: Beginner Session type: Tutorial

Deepak Shenoy

Financial Market and Economic Data

To demonstrate different ways to slice and dice financial market data and economic numbers. From static charts to moving and interactive visualizations, we’ll take a look at what markets have prompted investors, traders and managers to create in order to make sense of the madness. more
  • 3 comments
  • Confirmed & scheduled
  • 23 Apr 2012
Section: Data Visualization Technical level: Intermediate Session type: Lecture

Sumandro C

the aRt of NSSO data

Understanding the structure of raw data published by the National Sample Survey Office. Understanding the processes (and libraries) for extracting, exploring and visualising the data with R. more
  • 1 comment
  • Confirmed & scheduled
  • 25 Apr 2012
Section: Industry & Business Technical level: Intermediate Session type: Lecture

Jaidev Deshpande

Exploratory Data Analysis with Python

Objectives: Learning how to find general details about a dataset before jumping on to the machine learning / big data bandwagon. (I’m calling these things ‘bandwagon’ because they are incredibly powerful, and in many cases, the application might not warrant a full scale use of such tools.) more
  • 2 comments
  • Confirmed & scheduled
  • 25 Apr 2012
Section: Data Analytics Technical level: Intermediate Session type: Tutorial
Vinayak Hegde

Vinayak Hegde

A Herd of Elephants - Navigating the Hadoop Ecosystem

A overview of the Hadoop ecosystem and how the different parts of the ecosystem interact and fit together. more
  • 1 comment
  • Confirmed & scheduled
  • 26 Apr 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Tutorial
Vinayak Hegde

Vinayak Hegde

Seeing through the Haze - Advanced Visualisations using Python

How to use advanced visualizations to see multidimensional data using python libraries. more
  • 1 comment
  • Confirmed & scheduled
  • 26 Apr 2012
Section: Data Visualization Technical level: Advanced Session type: Demo

Karthik Kastury

Real Time Analytics with Twitter Storm

Perform Real time analysis of the data that is coming into your System from multiple sources. I’ll talk about Different Use Cases from a Performance Monitoring Standpoint, and a little bit on Data Visualization. more
  • 4 comments
  • Submitted
  • 26 Apr 2012
Section: Data Analytics Technical level: Intermediate Session type: Tutorial

Amritayan Nayak

Visual data apps using d3.js and Knoema Data API

How to build beautiful and engaging HTML5 visualizations using d3.js and Knoema Data API. more
  • 0 comments
  • Confirmed
  • 26 Apr 2012
Section: Data Visualization Technical level: Intermediate Session type: Tutorial

Purnachandra Rao Duggirala

How to select right charts for your data?

To teach beginners & regular users how to select right charts & visualizations based on the user needs. more
  • 0 comments
  • Submitted
  • 27 Apr 2012
Section: Data Visualization Technical level: Beginner Session type: Lecture

Purnachandra Rao Duggirala

Visualization using MS Excel - Hands on Workshop

This is a half-day workshop to teach how to visualize data using Excel more
  • 0 comments
  • Submitted
  • 27 Apr 2012
Section: Data Visualization Technical level: Intermediate Session type: Workshop

kracekumar-twitter Proposing

Data analysis with Pydata

How to use pydata to harness data analytics Outline more
  • 0 comments
  • Submitted
  • 28 Apr 2012
Section: Data Analytics Technical level: Beginner Session type: Tutorial

Milind Bhandarkar

Big Data Analytics with Greenplum Unified Analytics Platform

In this talk, attendees will learn various use cases of Big Data analytics, and how to solve them Greenplum’s Unified Analytics platform that combines Greenplum Chorus, a collaboration platform for data science teams; Greenplum Database, a powerful MPP database; and Greenplum HD, a distribution of Apache Hadoop. more
  • 1 comment
  • Submitted
  • 04 May 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

shree joshi

Introduction to data analysis & visualisation with R

To introduce basic concepts & application of R to audience. more
  • 5 comments
  • Confirmed & scheduled
  • 05 May 2012
Section: Data Analytics Technical level: Beginner Session type: Tutorial

Anoop Thomas Mathew

The Redis way of Analytics

Redis is one the fastest key-value pair, noSQL, in-memory database available on market, and many companies like disqus uses Redis for analytics. The workshop aims at enabling attendees quick prototyping, and designing fast and efficient analytic solutions with Redis and Python. Workshop will be unveiling the other side of Hadoop, with instant analytics of data. more
  • 0 comments
  • Submitted
  • 07 May 2012
Section: Data Analytics Technical level: Intermediate Session type: Workshop

Manoj Diwakar

Data Analysis using SAS Software.

The Statistical Analysis System (SAS 9.2 ) Software is extensively used around the world in the Medical and Social industries. SAS can visualise the large data and performed statistical analysis. more
  • 0 comments
  • Submitted
  • 07 May 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Onkar Hoysala

Games for Data Collection

To discuss the use of serious games for data collection. more
  • 0 comments
  • Submitted
  • 08 May 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Onkar Hoysala

Structuring data from surveys - A case study from a slum survey conducted in 2010

To present a case study of the slum survey conducted by NGIL, CSTEP in 2010 and to discuss the data collection and cleanup process, and the challenges that were faced. more
  • 16 comments
  • Confirmed & scheduled
  • 08 May 2012
Section: Industry & Business Technical level: Intermediate Session type: Lecture

Sherline Pimenta K.

Everybody Needs a Story: A Narrative Approach to Information Visualization

Understanding Information Visualization via the Visual Narrative (VN) paradigm. more
  • 0 comments
  • Submitted
  • 11 May 2012
Section: Data Visualization Technical level: Beginner Session type: Lecture

Lucy Chambers

OpenSpending: Data Literacy, Citizen Participation and Government Spending

Ever been curious about how your government spends your tax money? This session aims to show participants how they can use OpenSpending.org to explore public spending and hold governments to account. We’ll highlight examples of how others have used it, from tracking Big Brother states, our work in data journalism, myth-busting about government budgets in Hungary, showing the taxpayer how much an … more
  • 0 comments
  • Submitted
  • 18 May 2012
Section: Data Visualization Technical level: Intermediate Session type: Workshop
Pramod Biligiri

Pramod Biligiri

Big Data in a Small Startup

What unexpected problems and challenges will you find when you go tackling big data problems, esp. if you’re a bootstrapped startup with resource constraints? more
  • 1 comment
  • Submitted
  • 20 May 2012
Section: Big Data Infrastructure & Processing Technical level: Beginner Session type: Lecture

Joydeep Sen Sarma

The Elephant in the Cloud

How do you build a big data service in the Cloud? How can we make queries against relatively slow Cloud Storage Systems fast? How can we take real advantage of the elasticity available in the Cloud? How do you make the Cloud dead easy to use for big data processing? more
  • 2 comments
  • Confirmed & scheduled
  • 23 May 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Anannya Deb

From Data to Action (in Retail Banking)

The promise of data (and all the technologies, systems, processes, etc. related to it) for many years has been a world where people are empowered to lead successful lives. Customers get the right products and services, organisations become more profitable, governments become more responsive to their citizens, markets become more precise and predictable and so on and on. more
  • 2 comments
  • Confirmed & scheduled
  • 24 May 2012
Section: Industry & Business Technical level: Intermediate Session type: Lecture

Amit Kapoor

Targeting the Moment of Truth: Using Big Data in Retail

To share insights from use of big data by retailers in the area of pricing, promotion & supply chain and understand how some of the leading retailer are using big data analytics to design the next best offer for their customer more
  • 0 comments
  • Confirmed & scheduled
  • 25 May 2012
Section: Industry & Business Technical level: Beginner Session type: Lecture

Mohit Chawla

Logstash & Elasticsearch - Give meaning to your logs, and more

There is a lot of information available in your server/app logs. And a lot of noise, too. Either you can treat all of this as a dry lifeless source of information and only using them when troubleshooting/debugging or you can do interesting things with them, making sense out of them, and use them as an important data source to drive decisions for your infrastructure/app, pro-actively. more
  • 2 comments
  • Submitted
  • 27 May 2012
Section: Big Data Infrastructure & Processing Technical level: Beginner Session type: Lecture

Venkata Ramana

Approaches to ML techniques on the Real world data.

Learn how to think in applying ML techniques, and the practical aspects of it. more
  • 2 comments
  • Confirmed & scheduled
  • 28 May 2012
Section: Data Analytics Technical level: Intermediate Session type: Demo

Sameer Segal

CouchDB: All your JSON are belong to us

An overview of CouchDB and its amazing replication feature, horizontal scalability and (a completely new dimension to databases) called CouchApps. more
  • 0 comments
  • Submitted
  • 29 May 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Tutorial

Rahul Deshmukh

Splunk> Driving Real-time Analytics from Small Data, Big Data and Huge Data

Hear how more than half of Fortune 100 companies drive value from their data and how Splunk is helping organizations. more
  • 0 comments
  • Confirmed & scheduled
  • 30 May 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Girish Narasimha Raghavan

Operational Data Analytics and Insights - Leveraging Bigdata to manage IT infrastructure.

The data gathered from monitoring the IT infrastructure elements like hosts, network, web servers, etc are critical in ensuring smooth and reliable operations of services provided. Storing, analyzing, and delivering actionable insights from such data becomes very difficult and complex when the infrastructure setup is at a large scale as it is in Yahoo!. more
  • 1 comment
  • Submitted
  • 31 May 2012
Section: Industry & Business Technical level: Intermediate Session type: Lecture

Dorai Thodla

Explorations in Data Analysis and Visualization

To share knowledge on how our goals led us to build a simple data analysis and visualization tool and some ideas for the future. more
  • 0 comments
  • Confirmed & scheduled
  • 06 Jun 2012
Section: Data Visualization Technical level: Beginner Session type: Discussion

prashant singh

Managing Data on Hadoop

The paper talks about an approach on how to manage high volume data movement on hadoop, making it available for processing in Yahoo!. As part of grid data management, we load Terabytes of data daily onto hadoop clusters and replicate the same to BCP clusters. As part of this tech talk, we want to share our experiences, challenges and techniques of high volume data movement on hdfs. more
  • 0 comments
  • Submitted
  • 06 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Shantanu Gudihal

Emerging techniques in stake holder feedback management

Explore the newer techniques in feedback management with the advancements in the field of text analytics. more
  • 0 comments
  • Submitted
  • 06 Jun 2012
Section: Industry & Business Technical level: Intermediate Session type: Demo

Deepinder Dhingra

Real Time Intelligent Systems & Big Data Streams

To understand the latest trends in Real time analytics, and the advantages of applied predictive analytics. more
  • 3 comments
  • Confirmed & scheduled
  • 06 Jun 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Sridhar Bollam

How your favorite retailers make money from analytics?

Challenges in the retail industry & ways to solve them through data mining & predictive analytics more
  • 1 comment
  • Confirmed & scheduled
  • 08 Jun 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

qubole

Messaging architecture at Facebook

The audience will learn design options for building large scale messaging system and how technical, philosophical and organizational considerations surrounding these played out at Facebook. more
  • 1 comment
  • Confirmed & scheduled
  • 10 Jun 2012
Section: Industry & Business Technical level: Advanced Session type: Lecture

Prashanth Babu

An introduction to Data Analysis and Visualisation using D3.js

D3.js is a JavaScript library for manipulating documents based on data. more
  • 0 comments
  • Submitted
  • 10 Jun 2012
Section: Data Visualization Technical level: Beginner Session type: Workshop

Prashanth Babu

Hands-on introduction to Pig

Pig is a high-level platform for creating MapReduce programs used with Hadoop for analyzing Big Data. more
  • 0 comments
  • Submitted
  • 10 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Beginner Session type: Workshop

Gaurav Agarwal

The Elephant that Flew - Big Data Analytics @ InMobi

A discussion on the evolution of bigdata systems within InMobi and a discussion/demo of an in-house data processing and analytics system for large data at InMobi scale. more
  • 0 comments
  • Confirmed & scheduled
  • 10 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Sunil Sayyaparaju

How big data moved the needle from monolithic SQL RDBMS to distributed NoSQL

The objective of this talk is to talk about different types of DBMS solutions (SQL/NoSQL - monolith/distributed) and the type of applications that are appropriate for a type of DBMS. The talk will highlight the design choices made by the developers for these different types of DBMS. more
  • 1 comment
  • Submitted
  • 11 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Ramesh Hariharan

Your Genome on the Cloud: Big Data Challenges in Personalized Medicine

We are at the threshold of a major revolution in health care: thanks to two decades of explosive research in tools and techniques that interrogate living cells at the molecular level, doctors will soon have an invaluable tool added to their arsenal to help diagnose and cure disease, i.e., the genome of the patient. Several success stories have already emerged, for instance, a little boy who neede… more
  • 1 comment
  • Confirmed & scheduled
  • 12 Jun 2012
Section: Data Analytics Technical level: Beginner Session type: Lecture

Harish

Big Data in Real Time: Processing the Social Web within a tolerable elapsed time

Understanding the challenges posed while dealing with ‘Big Data’, in terms of time, processing and storage with special emphasis on social media intelligence systems which require the crawling and processing terabytes of data within a ‘reasonable’ time frame. more
  • 0 comments
  • Confirmed
  • 14 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Ashok Banerjee

Exponential Growth Models and Impact on Sales Forecast,Data Volume, Query Latency, Capacity Planning and Search Latency

We often loosely talk about exponential growth in this talk we will delve into the mathematical models of when a domain or market will undergo exponential growth. We often mistakenly believe the execution of one company is better than that of another, when in fact the domains and fundamental mathematical growth models of the 2 markets are in fact different. more
  • 0 comments
  • Confirmed & scheduled
  • 14 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Beginner Session type: Lecture

Vibodh Parthasarathi

Same News, Different Channels? – Measuring Diversity in TV News

Present an approach to measure differences in Content, News Themes and Programming across TV News channels; Discuss ways to scale up this approach and more creatively showcase its findings. more
  • 0 comments
  • Confirmed
  • 14 Jun 2012
Section: Industry & Business Technical level: Beginner Session type: Lecture

Siddhartha Reddy

Build Your Own Search Engine

No, this is not another tutorial on using Solr/ElasticSearch/Sphinx/Lucene. Imagine that none of these existed and you need a search engine for your shiny new eCommerce startup. What would you do? Build your own search engine, of course. I’ll go over what it would take to do that. more
  • 0 comments
  • Confirmed & scheduled
  • 15 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Ashwin Rajeev

Building analyzing and visualizing large graphs

A tutorial on how to model, use and visualize large graphs on existing storage systems. more
  • 4 comments
  • Confirmed & scheduled
  • 15 Jun 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Shourya Sarcar

Data Explosion in Medical Imaging

Gone, or rather going, are the days when you would exit a doctor’s chamber an X-ray plate in your hand. Medical imaging has turned filmless today - with the advent of CT, MR, and digital X-Ray machines churning out gigabytes of data in every tier 1 & 2 Indian city for each patient, are we prepared to archive, transfer and retrieve this data ? more
  • 1 comment
  • Confirmed & scheduled
  • 15 Jun 2012
Section: Industry & Business Technical level: Beginner Session type: Lecture

Shashwat Agarwal

Scaling Data ( DB ->Caching -> Archiving -> Sharding and NoSQL)

In this talk I will go over the stages of scaling for OLTP Data Processing. Each level of scaling takes step functions in incremental effort and should be deferred for within reason in a startup ecosystem. more
  • 0 comments
  • Submitted
  • 16 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Ashish Prasad

Finding the elephant with Neo4j

Introduction to using graph database called neo4j and how one can organize data in graphs to mine metrics that matter. more
  • 2 comments
  • Submitted
  • 18 Jun 2012
Section: Data Analytics Technical level: Beginner Session type: Tutorial

Arvind Saraf

Application of Data analytics in Small / Medium Enterprise planning

Understand the relevance of better use of data in SME planning - and think of various architectures / solutions that can be fit the problem more
  • 0 comments
  • Confirmed & scheduled
  • 19 Jun 2012
Section: Industry & Business Technical level: Beginner Session type: Lecture

Laura Newman

Become a Data Wrangler! Learning how to work with data.

Discover more about the School of Data. Try out some learning challenges to hone your skills, or turn your hand to writing and share your knowledge with Data Wranglers across the world! more
  • 1 comment
  • Submitted
  • 19 Jun 2012
Section: Data Analytics Technical level: Beginner Session type: Workshop

Anand Chitipothu

How the Internet Archive preserves petabytes of data

Using Internet Archive as a case study, this talk presents aspects of big data in the context of long-term preservation. more
  • 0 comments
  • Confirmed & scheduled
  • 20 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Beginner Session type: Lecture

Supreet Sethi

Start with step 0 -> Approaching big data for small enterprises

Big data promises business gains. But a small and medium enterprise finds itself in dilemma of where to start. This session provides refreshingly simple steps from familiar territory of excel sheets and database tables to high rolling game of big data. more
  • 0 comments
  • Submitted
  • 21 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Beginner Session type: Tutorial

Prasad A N

Analytics Platform - Experience and evolution

With the growth of video on the web & the explosion of social networks, analytics around media consumption is a prime field of late. In this talk, we explore Akamai’s offering for analytics, the media context, the needs thereof, the technology & its evolution. Not to mention, the challenges of doing at at internet scale! more
  • 0 comments
  • Confirmed & scheduled
  • 22 Jun 2012
Section: Industry & Business Technical level: Beginner Session type: Discussion

Ramesh Perumalsamy

Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates

This talk is not about usage of existing tools or frameworks. This is about building a platform for Real Time Analytics, Real time visualization and Real time Complex Event Processing triggered by Events, Event Patterns and Event Aggregates. more
  • 0 comments
  • Confirmed & scheduled
  • 22 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Demo

Gaurav Bhalotia

Recommendation Play @Flipkart

We will go over internet recommendation systems and how the user data can be mined to learn about preferences, tastes and behavior. The goal is to give users a flavor of building intelligent systems at scale. more
  • 0 comments
  • Confirmed & scheduled
  • 24 Jun 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Vivek Mehta

Survey of Machine Learning tools as the data grows (Weka, R, Octave, Mahout)

To review the available machine learning tools relevant for different size and scale of data. more
  • 2 comments
  • Submitted
  • 25 Jun 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Karthik Shashidhar

The Art of Data Analysis

The explosion in techniques for data and statistical modeling has not really led to the explosion in data-driven insights that one would have expected. more
  • 0 comments
  • Submitted
  • 25 Jun 2012
Section: Data Analytics Technical level: Intermediate Session type: Workshop

Avik Sarkar

Combining Social Media Insights with Statistical Insights for Improved Analytics

Various facts are discussed about organizations and brands on Social media. How can the organizations incorporate the findings from Social Media to improve Analytics process like identifying customer churn or finding users to target in a new marketing campaign? This can help in real time proactive action by organizations to retain customers or improve customer satisfaction. This 360 degrees view … more
  • 1 comment
  • Submitted
  • 25 Jun 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Lucy Chambers

The Data Journalism Handbook

What is Data Journalism? How can I get started? Can anyone do it? What is the Data Journalism scene like in India? This session will tackle questions such as these and will show you that anyone can be a data journalism (and that it is a jolly exciting thing to be!) more
  • 0 comments
  • Confirmed & scheduled
  • 27 Jun 2012
Section: Industry & Business Technical level: Beginner Session type: Discussion

Prabhu Ramachandran

Lessons from the evolution of Mayavi

Learn about a successful 3D data visualization package and the story of how and why it grew the way it did. more
  • 0 comments
  • Confirmed & scheduled
  • 28 Jun 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Santosh Gannavarapu

Dianemo: Distributed task management to achieve faster throughput

There is always a need to run batch/offline jobs while utilizing resources efficiently. The initial attempts are to set aside pre-configured worker boxes that will pick up jobs either by a cron scheduler or by a pre-designated manager application that is responsible for kicking off processes. Assigning boxes statically would necessitate deploying boxes that may either run out of bandwidth or may … more
  • 0 comments
  • Submitted
  • 28 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Kaushik Paranjape

Sharding MySQL

Learn about how MySQL database can be scaled up to for your requirements. more
  • 0 comments
  • Submitted
  • 28 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Prashant Kumar

Building the Infrastructure to Handle Big Data

Understanding the pieces involved in building a generic Big data infrastructure more
  • 3 comments
  • Confirmed & scheduled
  • 28 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Prashant Kumar

Riak: Robust and featureful distributed Key-Value store

In this talk I’ll try to bring out unique features for Riak compared to other NoSQL databases. In particular its rich API, high availability and its use as a core building block for big data. more
  • 0 comments
  • Submitted
  • 28 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Bibhash Chakrabarty

Identifying your influencers in social media is not rocket science

Help initiate infrastructure & algorithm building for an influencer identification and engagement program on social media more
  • 0 comments
  • Submitted
  • 28 Jun 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Sharad Agarwal

Big Data : The wave of Innovation and Disruption

To understand the ‘big’ picture and how the intersection of technology innovation, upcoming verticals and business landscape is driving the growth of Big Data ecosystem. more
  • 0 comments
  • Submitted
  • 28 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Beginner Session type: Lecture

Karthikeyan Damodaran

Analytics is a magic wand for HR professionals

Overview of Workforce Analytics, as a competency How some very persistent and intractable HR challenges can be overcome through HR Analytics- Example: Maximizing Retention, Compensation & Rewards Optimization, Boosting Hiring Effectiveness etc more
  • 1 comment
  • Confirmed & scheduled
  • 29 Jun 2012
Section: Industry & Business Technical level: Beginner Session type: Lecture

Biswajit Pal

“I know what you are going to do next summer” – Predicting Repeat Purchase Behavior by using Bayesian Hierarchical model and Regression Techniques

A major question in database marketing is that of identifying the customers who are most likely to make a repeat purchase in the near future. We will discuss a scalable repeat purchase scoring algorithm that assign a propensity score i.e. probability to transact in next ‘n’ period of time for each customer. This algorithm presently scores an entire database containing around 100MM customers in co… more
  • 1 comment
  • Confirmed & scheduled
  • 29 Jun 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Kushwaha Manish Kaushal

Handle BigData Analytics with Hadoop eco-system (Hadoop, HBase, Hive, WorkFlow)

To give insights of the problems and solutions if you are working on very high volume of data (~ 330 TB of data). Problems involves with Hardware Infrastructure, and in functional treatment. BigData problem increases further if your data collection size going up by 10% per month. Solution through Hadoop eco-system. more
  • 0 comments
  • Submitted
  • 29 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Piyush Goel

Making sense out of semi-structured log data

Discuss how to use big-data systems for analyzing Log data in real time. more
  • 0 comments
  • Submitted
  • 29 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Puneeth Chaganti

Data Visualization with Python

By the end of this workshop, participants will be able to more
  • 0 comments
  • Submitted
  • 29 Jun 2012
Section: Data Visualization Technical level: Intermediate Session type: Workshop

pranavbhasin

How to balance insights from analytics and your intuition to drive product decisions

By the end of this session, the audience will have a good framework to make data driven decisions, at the same time using your intuition to guide you. more
  • 0 comments
  • Submitted
  • 29 Jun 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Anindya Neogi

"Honey, we blew the data - to build a Market Mix Model" : Modeling the Market Mix under unfavourable data conditions

The term Market mix modeling is widely used and applied indiscriminately to a broad range of marketing models used to evaluate different components of marketing plans, such as advertising, promotion, packaging, media weight levels, sales force numbers, etc. These models can be of many types, but multiple regression is the workhorse of most marketing mix modeling. Regression is based on a number o… more
  • 0 comments
  • Submitted
  • 30 Jun 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Gaurav Mishra

Making sense of UGC on social networks

Social media generates a massive amount of data and not all brands are drawing enough sense out of it. A platform which has conversations to listen to, the question is how to create an understanding from it and use the data to identify consumers and prospects. more
  • 0 comments
  • Submitted
  • 30 Jun 2012
Section: Industry & Business Technical level: Beginner Session type: Lecture

Gitali Halder Proposing

Mobile Portal Personalization using Real-Time Prediction Analytics in order to help Communication Service Providers (CSPs) in customized offerings to users

With the phenomenal growth of mobile multimedia applications across various device vendor-controlled App Stores, Communication Service Providers (CSP) are immensely challenged to identify new revenue streams due to the gradual cannibalization of its voice and messaging revenues by next generation social media players from the Internet community. Our aim is to create a real-time solution to enable… more
  • 0 comments
  • Submitted
  • 30 Jun 2012
Section: Big Data Infrastructure & Processing Technical level: Advanced Session type: Lecture

Gora Mohanty

Fieldata: Transparency and analytics for non-profits

Fieldata ( http://fieldata.org ) is an initiative aimed at promoting data-driven decision-making in the non-profit sector in India; including NGOs, governmental organisations, and academic institutions. more
  • 0 comments
  • Submitted
  • 30 Jun 2012
Section: Industry & Business Technical level: Beginner Session type: Lecture

Nikhil Pahwa

RTI & Data: Opportunities, Issues & Challenges

You’ll understand what kind of data you can get from the government and how you can use this data, challenges faced when asking for information, with the data you receive. more
  • 0 comments
  • Confirmed & scheduled
  • 01 Jul 2012
Section: Industry & Business Technical level: Beginner Session type: Discussion

apoorv durga

Social Media Monitoring - what is it and why do you need to know about it?

This fast-paced presentation will share customer research from noted evaluation firm Real Story Group on leading Social Media Monitoring and Intelligence (SMM/I) tools, and provide a framework for customers to assess technology choices based on their particular needs. Specifically, a methodology for mapping business needs to technology alternatives, as well as a roadmap for evaluating SMM/I vendo… more
  • 1 comment
  • Submitted
  • 01 Jul 2012
Section: Industry & Business Technical level: Beginner Session type: Tutorial

Kalpana Krishnaswami

Open data and the life sciences: An open world

An example of building real world semantic web applications for the life sciences. Challenges, perspectives and our experiences. more
  • 0 comments
  • Confirmed & scheduled
  • 02 Jul 2012
Section: Industry & Business Technical level: Beginner Session type: Lecture

Ankur Pandey

The science behind Predictive Analytics- a Text Mining Perspective

I will introduce key concepts and algorithms from Statistics, Machine Learning, Information Retrieval, etc. which are being used in the text mining side of the predictive analytics domain. more
  • 1 comment
  • Confirmed & scheduled
  • 02 Jul 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Arvind K

Towards Social Semantic Tools

People on social media sites like twitter, facebook, etc have been generating a lot of data for some time now and there has been attempts to analyse the sentiment of the moment - say for example to guess the trend of some stock price. However, for most of us this data is of interest if we can assimilate something of interest and present it in a different context. We present a simple set of tools,… more
  • 0 comments
  • Confirmed & scheduled
  • 02 Jul 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Demo

Prithvijit roy

Simplifying advanced analytics to empower business managers

Intent of the session is to provide a comprehensive picture of the key challenges analytics is facing in the world of business and how we can solve them with help of technology and business context more
  • 0 comments
  • Confirmed & scheduled
  • 03 Jul 2012
Section: Industry & Business Technical level: Beginner Session type: Lecture

Karthik Visweswariah

Building Watson -- A Brief Overview of DeepQA and the Jeopardy! Challenge

We will give an overview of the building of Watson a computer system that was able to defeat human grand champions in the game of Jeopardy (a popular quiz show in the United States) more
  • 0 comments
  • Confirmed & scheduled
  • 04 Jul 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Sandeep Amar

Everything you wanted to know about internet analytics - to infinity and beyond.....

This session will give a complete overview of internet data and analytics: more
  • 0 comments
  • Submitted
  • 05 Jul 2012
Section: Data Analytics Technical level: Advanced Session type: Lecture

rhebbar

Social Media & Text Analytics using Open Source Tools

Discuss how you can start analyzing Social Media Data today using Open Source Tools like RapidMiner or Open Source Languages like Python, R and PERL. more
  • 1 comment
  • Confirmed & scheduled
  • 05 Jul 2012
Section: Data Analytics Technical level: Beginner Session type: Lecture

Joseph Ziegler

Big Data and the Cloud a Best Friend Story

Communicate the challenges of big data processing that the cloud addresses, such as public data sets, sharing, collection, storage and analytic. more
  • 1 comment
  • Confirmed & scheduled
  • 06 Jul 2012
Section: Big Data Infrastructure & Processing Technical level: Beginner Session type: Lecture

Regunath Balasubramanian

Aadhaar - world's largest biometric identity platform (200 trillion biometric matches per day, 2 PB of data)

Describe the technology needs and solutions behind Aadhaar - the world’s largest biometric identity platform. more
  • 3 comments
  • Confirmed & scheduled
  • 06 Jul 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

ahmar abdullah

Delivering profitable growth for eCommerce business using advance analytics

The session will give an understanding of how advance analytics can be used to drive profitable growth in eCommerce business. Various problems that exists along the eCommerce value chain are identified and analytics solutions for the same discussed. more
  • 0 comments
  • Submitted
  • 06 Jul 2012
Section: Data Analytics Technical level: Intermediate Session type: Lecture

Chandu Nair

Making money out of data

To help folks understand the ways money can be made out of data in its various forms more
  • 0 comments
  • Submitted
  • 06 Jul 2012
Section: Industry & Business Technical level: Intermediate Session type: Discussion

Navjot Sidhu

Looking beyond the usual suspects for Near Real Time analytics on Big Data

Share lessons learned that help look beyond the hype of big data. Some insights on infrastructure, limitations and capabilities available. more
  • 0 comments
  • Confirmed & scheduled
  • 09 Jul 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Lecture

Anand Shankar

A complete suite setup and challenges on modern database

A complete suite setup and challenges on modern database and hands-on. more
  • 1 comment
  • Submitted
  • 09 Jul 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Workshop

Rahul Kulkarni

Crunching big data, Google scale

Insights from Google’s experience on handling big data, overview of techniques and products (with case studies) on crunching big data. more
  • 0 comments
  • Confirmed & scheduled
  • 09 Jul 2012
Section: Big Data Infrastructure & Processing Technical level: Beginner Session type: Lecture

Venky Shankar

GlusterFS "Big Data" Interface

Infrastructure for Big-Data processing (drop-in replacement for Hadoop Distributed File System - HDFS) more
  • 0 comments
  • Confirmed & scheduled
  • 17 Jul 2012
Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Demo

Harish Pillay

In Data We Believe

Data drives eveything we do. Without it, everything stops. Literally. This swamp of data we are both floating and sinking is not really new. What is new is that the enormous amounts of data is now at our fingertips. It is not stored away and at rest. We are at the dawn of a golden period of gaining insights from this data that could not have been possible even five years ago. This talk will take … more
  • 0 comments
  • Confirmed & scheduled
  • 18 Jul 2012
Section: Big Data Infrastructure & Processing Technical level: Beginner Session type: Discussion

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more