by The Fifth Elephant

The Fifth Elephant 2014

A conference on big data and analytics

The Fifth Elephant 2014

A conference on big data and analytics

by The Fifth Elephant
date_range

Date

23–26 Jul 2014

place

Venue

NIMHANS Convention Centre

About

In 2014, infrastructure components such as Hadoop, Berkeley Data Stack and other commercial tools have stabilized and are thriving. The challenges have moved higher up the stack from data collection and storage to data analysis and its presentation to users. The focus for this year’s conference on analytics – the infrastructure that powers analytics and how analytics is done.

Talks will cover various forms of analytics including real-time and opportunity analytics, and technologies and models used for analyzing data.

Proposals will be reviewed using 5 criteria:
Domain diversity – proposals will be selected from different domains – medical, insurance, banking, online transactions, retail. If there is more than one proposal from a domain, the one which meets the editorial criteria will be chosen.
Novelty – what has been done beyond the obvious. Insights – what insights does the proposal share with the audience that they did not know earlier. Practical versus theoretical – we are looking for applied knowledge. If the proposal covers material that can be looked up online, it will not be considered.
Conceptual versus tools-centric – tell us why, not how. Tell the audience what was the philosophy underlying your use of an application, not how an application was used. Presentation skills – proposer’s presentation skills will be reviewed carefully and assistance provided to ensure that the material is communicated in the most precise and effective manner to the audience.

Tickets: http://fifthel.doattend.com

Website: https://fifthelephant.in/2014

For queries about proposals / submissions, write to info@hasgeek.com

Theme

  1. Data Collection and Transport – for e.g, Opendatatoolkit, Scribe, Kafka, RabbitMQ, etc.

  2. Data Storage, Caching and Management – Distributed storage (such as Gluster, HDFS) or hardware-specific (such as SSD or memory) or databases (Postgresql, MySQL, Infobright) or caching/storage (Memcache, Cassandra, Redis, etc).

  3. Data Processing, Querying and Analysis – Oozie, Azkaban, scikit-learn, Mahout, Impala, Hive, Tez, etc.

  4. Real-time analytics

  5. Opportunity analytics

  6. Big data and security

  7. Big data and internet of things

  8. Data Usage and BI (Business Intelligence) in different sectors.

Please note: the technology stacks mentioned above indicate latest technologies that will be of interest to the community. Talks should not be on the technologies per se, but how these have been used and implemented in various sectors, enterprises and contexts.

Venue

Dairy Circle

Bangalore, , IN

All proposals

Confirmed sessions

Large Scale Modelling and Analytics Challenges at a Payments Company

subhajit sanyal (@subhajit)

  • Full talk
  • Intermediate
  • 1 upvotes
  • 0 comments
  • Fri, 4 Jul

Real Time User-Scoring for Bidding in Display Retargeting

Ambuj Singh (@ambujs)

  • Crisp talk
  • Beginner
  • 1 upvotes
  • 0 comments
  • Thu, 3 Jul

Getting your hands dirty with Aerospike

Sunil Sayyaparaju (@sunils)

  • Sponsored workshop
  • Intermediate
  • 4 upvotes
  • 0 comments
  • Tue, 1 Jul

Using Data for Art

Rasagy Sharma (@rasagy)

  • Crisp talk
  • Beginner
  • 24 upvotes
  • 0 comments
  • Sun, 15 Jun

Scaling SolrCloud to a large number of collections

Shalin Mangar (@shalinmangar)

  • Full talk
  • Advanced
  • 7 upvotes
  • 0 comments
  • Sun, 15 Jun

Big data in finance

Chirag Anand (@chiraganand)

  • Full talk
  • Intermediate
  • 8 upvotes
  • 1 comments
  • Fri, 13 Jun
  • slideshow

Dr. Hadoop – Diagnose your Hadoop Jobs

Chandraprakash Bhagtani (@cpbhagtani)

  • Crisp talk
  • Intermediate
  • 15 upvotes
  • 4 comments
  • Fri, 13 Jun

De-dup on Hadoop

Neeta Pande (@neetapande)

  • Crisp talk
  • Beginner
  • 4 upvotes
  • 0 comments
  • Thu, 12 Jun
  • slideshow

Real world machine learning

Harshad Saykhedkar (@harshss)

  • Workshops
  • Intermediate
  • 6 upvotes
  • 0 comments
  • Wed, 11 Jun

The state of Julia - a fast language for technical computing

Viral B. Shah (@viralbshah)

  • Crisp talk
  • Intermediate
  • 9 upvotes
  • 0 comments
  • Wed, 11 Jun

Lessons from Elasticsearch in production

Swaroop (@swaroopch)

  • Full talk
  • Intermediate
  • 32 upvotes
  • 1 comments
  • Wed, 11 Jun

Data sciences (is) in fashion @ Myntra

Divya Alok (@divyaalok)

  • Full talk
  • Intermediate
  • 8 upvotes
  • 2 comments
  • Tue, 10 Jun

Analytics on Large Scale, Unstructured, Dynamic Data using Lambda Architecture

Rajesh Muppalla (@codingnirvana)

  • Full talk
  • Intermediate
  • 16 upvotes
  • 4 comments
  • Mon, 9 Jun

Using Cascalog and Clojure to make the elephant move!

Harshad Saykhedkar (@harshss)

  • Crisp talk
  • Intermediate
  • 3 upvotes
  • 1 comments
  • Sun, 8 Jun

The ART of Data Mining - Practical Learnings from Real-world Data Mining applications

Shailesh Kumar (@shkumar)

  • Full talk
  • Intermediate
  • 18 upvotes
  • 8 comments
  • Tue, 3 Jun

Scaling Spatial Data - OpenStreetMap as Infrastructure.

Sajjad Anwar (@geohacker)

  • Full talk
  • Intermediate
  • 13 upvotes
  • 0 comments
  • Tue, 3 Jun

Machine Learning using R : Crash course in Classification Methods

Bargava Subramanian (@barsubra)

  • Workshops
  • Beginner
  • 8 upvotes
  • 2 comments
  • Sun, 1 Jun

Machine learning at scale with Spark

madhukara phatak (@madhukaraphatak)

  • Full talk
  • Beginner
  • 5 upvotes
  • 0 comments
  • Sat, 31 May

Live analytical dashboards at scale - SQL style

Shashwat Agarwal (@shashwatag)

  • Full talk
  • Intermediate
  • 10 upvotes
  • 7 comments
  • Mon, 26 May

Apache Tez: Accelerating Hadoop Data Pipelines

t3rmin4t0r (@t3rmin4t0r)

  • Full talk
  • Beginner
  • 13 upvotes
  • 4 comments
  • Fri, 23 May

How to build a Data Stack from scratch

Vinayak Hegde (@vin)

  • Full talk
  • Intermediate
  • 32 upvotes
  • 1 comments
  • Wed, 21 May
  • slideshow

Scaling real time visualisations for Elections 2014

Anand S (@sanand0)

  • Full talk
  • Intermediate
  • 18 upvotes
  • 0 comments
  • Mon, 19 May

Experimentation to Productization : developing a Dynamic Bidding system for a location aware Mobile landscape

Ekta Grover (@ekta1007)

  • Full talk
  • Intermediate
  • 20 upvotes
  • 0 comments
  • Mon, 12 May
  • slideshow

Unified analytics platform for Bigdata

Amareshwari Sriramadasu (@amareshwari)

  • Full talk
  • Intermediate
  • 12 upvotes
  • 3 comments
  • Mon, 12 May

Storing relationships in large data-sets using Graphs

Inder Singh (@indersingh)

  • Crisp talk
  • Advanced
  • 9 upvotes
  • 3 comments
  • Sun, 11 May
  • slideshow

Realizing Large-scale Distributed Deep Learning Networks over GraphLab

Dr. Vijay Srinivas A (@avijaysrinivas)

  • Full talk
  • Intermediate
  • 19 upvotes
  • 1 comments
  • Wed, 7 May

Building distributed search applications using Apache SOLR

Saumitra Srivastav (@saumitra)

  • Workshops
  • Beginner
  • 8 upvotes
  • 6 comments
  • Mon, 28 Apr
  • play_arrow
  • slideshow

Why we built the most adopted Polyglot Object Mapper for NoSQL?

Vivek Shrivastava (@vishri)

  • Full talk
  • Intermediate
  • 27 upvotes
  • 1 comments
  • Fri, 25 Apr

'Know Your Customer!' - Advanced Data Science for Audience Segmentation

prabhakar srinivasan (@prabhacar7)

  • Full talk
  • Advanced
  • 17 upvotes
  • 3 comments
  • Mon, 21 Apr

Crafting Visual Stories with Data

Amit Kapoor (@amitkaps)

  • Full talk
  • Beginner
  • 10 upvotes
  • 0 comments
  • Fri, 28 Mar
  • slideshow

Circuitscape - A Case Study on Scientific Computing

Viral B. Shah (@viralbshah) via Tanmay K. Mohapatra (@tanmaykm)

  • Full talk
  • Intermediate
  • 10 upvotes
  • 0 comments
  • Mon, 3 Mar

Serving user intent : Facebook style notifications using HBase and Event streams

Regunath Balasubramanian (@regunathb)

  • Full talk
  • Intermediate
  • 23 upvotes
  • 2 comments
  • Fri, 31 Jan

Unconfirmed proposals

How to deploy a 50 node SolrCloud cluster on AWS in 15 minutes

Shalin Mangar (@shalinmangar)

  • Crisp talk
  • Beginner
  • 6 upvotes
  • 0 comments
  • Sun, 15 Jun

Overcoming problems that you will face when trying to break speed limit

Sunil Sayyaparaju (@sunils)

  • Full talk
  • Intermediate
  • 16 upvotes
  • 1 comments
  • Sun, 15 Jun

Supercharge Application I/O Performance with SSD caching

Sumit Kumar (@sumitk)

  • Full talk
  • Intermediate
  • 15 upvotes
  • 0 comments
  • Sun, 15 Jun

big data analytics with machine learning

Swapnil Birla (@swapnilbirla)

  • Crisp talk
  • Beginner
  • 2 upvotes
  • 0 comments
  • Thu, 12 Jun

Interactive analytics on event streams with complexly nested schemas

Abishek Baskaran (@abishekbaskaran)

  • Full talk
  • Intermediate
  • 47 upvotes
  • 0 comments
  • Thu, 12 Jun

Twitter data collection framework for dummies.

Nischal HP (@nischalhp)

  • Full talk
  • Beginner
  • 9 upvotes
  • 0 comments
  • Wed, 11 Jun

Latest trends in Market Mix Modeling & a unique way of making measurement & optimization more effective

rhebbar (@rhebbar) (proposing)

  • Crisp talk
  • Advanced
  • 17 upvotes
  • 1 comments
  • Mon, 9 Jun
  • slideshow

Ten things to consider for Interactive Analytics on high volume, write-once workloads

Abinasha Karana (@abhinashak)

  • Full talk
  • Advanced
  • 5 upvotes
  • 0 comments
  • Mon, 9 Jun
  • slideshow

Filtering the noise from an avalanche of Google Analytics Metrics : Anomaly Detection

Kushan Shah (@shahkushan17)

  • Crisp talk
  • Intermediate
  • 4 upvotes
  • 0 comments
  • Sat, 7 Jun

Real Time Secure API delivering data @ scale

Akash Mishra

  • Crisp talk
  • Beginner
  • 4 upvotes
  • 1 comments
  • Wed, 4 Jun

Migrating traditional warehouse and its applications to a Big-data platform

Manish Shukla (@manishshukla)

  • Full talk
  • Intermediate
  • 6 upvotes
  • 0 comments
  • Wed, 4 Jun

Fast Elephant - the Cheeliphant (Cheetah-Elephant)!

Ashok Banerjee (@ashokbanerjee)

  • Full talk
  • Beginner
  • 24 upvotes
  • 0 comments
  • Tue, 3 Jun

Run Predictive Machine Learning algorithms on Hadoop without even knowing Mapreduce.

GaganDeep Juneja (@gagandeepjuneja)

  • Full talk
  • Intermediate
  • 6 upvotes
  • 1 comments
  • Tue, 3 Jun

Advanced Big Data Analytics using Apache Mahout and Giraph

swapnil dubey (@swapnildubey1984)

  • Workshops
  • Advanced
  • 25 upvotes
  • 6 comments
  • Mon, 2 Jun
  • slideshow

Machine learning + Interactive visualization: A pragmatic approach to fixing knowledge bases

Viraj Paripatyadar (@virajparipatyadar)

  • Full talk
  • Beginner
  • 2 upvotes
  • 0 comments
  • Sun, 1 Jun

Tailor made stores at myntra or how to personalize your search results

Apoorva Gaurav (@apoorvagaurav)

  • Crisp talk
  • Intermediate
  • 7 upvotes
  • 2 comments
  • Sat, 31 May

Lambda Architecture

Nitin Supekar (@nsupekar)

  • Full talk
  • Intermediate
  • 3 upvotes
  • 1 comments
  • Fri, 23 May

De-dup @ Scale : Experiments with DynamoDB

Hemanth Yamijala (@yhemanth)

  • Full talk
  • Intermediate
  • 27 upvotes
  • 3 comments
  • Thu, 22 May

Hive and Presto for Big Data Analytics in the Cloud

Vikram Agrawal (@vikram)

  • Full talk
  • Intermediate
  • 19 upvotes
  • 2 comments
  • Tue, 20 May
  • slideshow

Using Elasticsearch for Analytics

Vaidik Kapoor (@vaidik)

  • Full talk
  • Intermediate
  • 13 upvotes
  • 4 comments
  • Sun, 18 May

Extracting and Employing Domain-Specific Knowledge Graphs (DKGraphs)

Satnam Singh, PhD (@satnam-datageek)

  • Full talk
  • Beginner
  • 9 upvotes
  • 0 comments
  • Tue, 13 May

Extending Vega - A visualisation grammar to create interactive visualisations

anupamme (@anupamme)

  • Crisp talk
  • Beginner
  • 6 upvotes
  • 4 comments
  • Sat, 3 May

Spot the model hiding in the Big Data

Ashok Banerjee (@ashokbanerjee)

  • Full talk
  • Beginner
  • 34 upvotes
  • 7 comments
  • Wed, 30 Apr

Apache Pig Power tools

visuthemoon (@vissuthedatascientist)

  • Workshops
  • Intermediate
  • 20 upvotes
  • 8 comments
  • Mon, 28 Apr
  • slideshow

BDAS, the Berkeley Data Analytics Stack

Mukesh Gangadhar (@mukgbv)

  • Crisp talk
  • Beginner
  • 4 upvotes
  • 0 comments
  • Tue, 15 Apr

What would you recommend?

Anand (@anandk)

  • Workshops
  • Intermediate
  • 5 upvotes
  • 1 comments
  • Fri, 11 Apr

Curating A Hunderd Thousand Online Stores Using Storm, ElasticSearch and Etcd

Suman Karthik (@mrphoebs)

  • Full talk
  • Intermediate
  • 16 upvotes
  • 1 comments
  • Wed, 9 Apr

Scaling with Queues

Rohit Yadav (@bhaisaab)

  • Full talk
  • Intermediate
  • 11 upvotes
  • 2 comments
  • Tue, 1 Apr

What chemistry can teach us about designing better NLP algorithms

Siva Prakash Kollana (@sivaprakash)

  • Crisp talk
  • Beginner
  • 17 upvotes
  • 4 comments
  • Thu, 27 Mar

Big Data in Telecom - Case studies

Siddharth Vijayvergiya (@vijayvergiya)

  • Full talk
  • Intermediate
  • 2 upvotes
  • 0 comments
  • Thu, 27 Mar

Developing Real-Time Data Pipelines with Apache Kafka

Manisha Sethi (@manishasethi)

  • Full talk
  • Advanced
  • 4 upvotes
  • 1 comments
  • Thu, 27 Mar

How to Make Big Data Real and Valuable ...

Mayur Shah (@ssmayur)

  • Crisp talk
  • Intermediate
  • -1 upvotes
  • 1 comments
  • Thu, 27 Mar

ANALYTICS ON BIG FAST DATA USING REAL TIME STREAM DATA PROCESSING ARCHITECTURE

Arvind Gopinath (@arvindo)

  • Full talk
  • Intermediate
  • 24 upvotes
  • 3 comments
  • Sun, 2 Mar

Engineering custom visualisations with advanced d3.js

Chirag Gehlot (@chiraggehlot)

  • Workshops
  • Advanced
  • 12 upvotes
  • 0 comments
  • Mon, 3 Feb

Visualizing large data sets

Puneet Mohan Sangal (@pmsangal)

  • Full talk
  • Intermediate
  • 17 upvotes
  • 1 comments
  • Thu, 30 Jan
  • slideshow