Make a submission

Accepting submissions till 15 Jun 2019, 01:00 PM

NIMHANS Convention Centre, Bengaluru

Tickets

Loading…

##The eighth edition of The Fifth Elephant will be held in Bangalore on 25 and 26 July. A thousand data scientists, ML engineers, data engineers and analysts will gather at the NIMHANS Convention Centre in Bangalore to discuss:

  1. Model management, including data cleaning, instrumentation and productionizing data science.
  2. Bad data and case studies of failure in building data products.
  3. Identifying and handling fraud + data security at scale
  4. Applications of data science in agriculture, media and marketing, supply chain, geo-location, SaaS and e-commerce.
  5. Feature engineering and ML platforms.
  6. What it takes to create data-driven cultures in organizations of different scales.

##Highlights:

1. Meet Peter Wang, co-founder of Anaconda Inc, and learn about why data privacy is the first step towards robust data management; the journey of building Anaconda; and Anaconda in enterprise.
2. Talk to the Fulfillment and Supply Group (FSG) team from Flipkart, and learn about their work with platform engineering where ground truths are the source of data.
3. Attend tutorials on Deep Learning with RedisAI; TransmorgifyAI, Salesforce’s open source AutoML.
4. Discuss interesting problems to solve with data science in agriculture, SaaS perspective on multi-tenancy in Machine Learning (with the Freshworks team), bias in intent classification and recommendations.
5. Meet data science, data engineering and product teams from sponsoring companies to understand how they are handling data and leveraging intelligence from data to solve interesting problems.

##Why you should attend?

  1. Network with peers and practitioners from the data ecosystem
  2. Share approaches to solving expensive problems such as cleanliness of training data, model management and versioning data
  3. Demo your ideas in the demo session
  4. Join Birds of Feather (BOF) sessions to have productive discussions on focussed topics. Or, start your own Birds of Feather (BOF) session.

##Full schedule published here: https://hasgeek.com/fifthelephant/2019/schedule

##Contact details:
For more information about The Fifth Elephant, sponsorships, or any other information call +91-7676332020 or email info@hasgeek.com

#Sponsors:

Sponsorship Deck.
Email sales@hasgeek.com for bulk ticket purchases, and sponsoring 2019 edition of JSFoo:VueDay.

JSFoo:VueDay 2019 sponsors:

#Platinum Sponsor

Anatta

#Community Sponsors

Salesforce Ericsson freshworks
databricks

#Exhibition Sponsors

Sapient Atlassian GO-JEK
Bayer

#Bronze Sponsor

Sumologic Walmart Labs Atlan
Simpl Great Learning

#Community Sponsors

Elastic Anaconda Aruba Networks

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

archit agarwal

@arcagarwal

Turning Data into Actionable Insights in Real Time

Submitted Jun 15, 2019

This talk will share our learnings and best practices in building our data pipeline which is handling billion of events per day and latency in single digit(seconds). how we moved from Spring microservices to Akka framework and how we reduced our VM footprint by 85% using Akka framework and.We have seen a huge growth in data in recent years and using Spring was not scalable.I will share how PayPal analytics pipeline is processing billions of events and what is the techstack we are using to achieve this feat.How we are processing data and making it useful to make decisions out of that data.This talk will start from how we are acquiring the data to how we are processing the data as well as visualization. Will tell how we are utilizing Kakfa,Spark and Druid(Open Stack) in our ecosystem.This talk should help anyone new into building data processing pipelines in their organization.

Outline

The flow would look like this:
1- About PayPal
2- Introduction to Real Time Analtyics Pipeline
3- How we are acquiring the data from paypal site(ClickStream Analtyics) making use of Akka Framework.
4- Messaging layer -> How we are using Kafka.
5- How we are processing the data in real time using Spark Streaming and will also share storage file format to save space.
6- Visualization of the data so that analysts can make meaningful insights out of data and also how we are getting dashboards loaded within seconds with the help of Druid.
7- Connecting all the dots
8- Takeways

Speaker bio

I am working as a Senior Data Engineer at PayPal since 1 year.I have the passion for building SCALABLE(billions of request with single digit latency) systems software. In my spare time I read about tech blogs of different companies and play badminton.

Slides

https://www.slideshare.net/secret/BHUFD3HIRUMQqZ

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Make a submission

Accepting submissions till 15 Jun 2019, 01:00 PM

NIMHANS Convention Centre, Bengaluru

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more