Submissions
Submit a talk on data

Submit a talk on data

Submit talks on data engineering, data science, machine learning, big data and analytics through the year – 2019

Accepting submissions till 31 Dec 2020, 11:59 PM

Not accepting submissions

Submit a talk on data.

Akash Tandon

Making sense of messy data to track disease outbreaks in India

In spite of open data portals cropping up across multiple domains, working with the datasets they provide is difficult. In our bid to identify disease outbreaks and aid preventive health-care, we came across one such data source. more
  • 7 comments
  • Rejected
  • 09 Apr 2018
Technical level: Intermediate

Aravind Putrevu

A NoSQL search engine to search ^ H ^ H ^ H ^ H ^ H ^ H find ...

Are you still searching within your data with some SELECT * FROM person WHERE name like ‘%aravind%putrevu%’? more
  • 7 comments
  • Under evaluation
  • 11 Apr 2018
Technical level: Beginner

Aravind Putrevu

Machine Learning with Elasticsearch

As volumes of data increase, manually searching and visualising consumer or user behaviours becomes more and more difficult. An alternative approach is to use machine learning to automatically build behavioural models of these behaviours. more
  • 8 comments
  • Under evaluation
  • 11 Apr 2018
Technical level: Intermediate

Kalpit Desai

Video thumbnail

The Catalog as a Catalyst - Bringing benefits of Big Data to MSMEs

While large enterprises have the necessary resources to acquire and process Big Data, the Micro / Small / Medium enterprises in emerging economies like India are far from being ‘data-driven’. This is a huge opportunity untapped, considering that MSMEs account for more than 99% of businesses, and they make up the backbone of our economy. For the opportunity to be leveraged, a crucial pre-requisite… more
  • 0 comments
  • Awaiting details
  • 12 Apr 2018
Technical level: Advanced

Kalpit Desai

Video thumbnail

The Catalog as a Catalyst - Bringing benefits of Big Data to MSMEs

While large enterprises have the necessary resources to acquire and process Big Data, the Micro / Small / Medium enterprises in emerging economies like India are far from being ‘data-driven’. This is a huge opportunity untapped, considering that MSMEs account for more than 99% of businesses, and they make up the backbone of our economy. For the opportunity to be leveraged, a crucial pre-requisite… more
  • 3 comments
  • Awaiting details
  • 12 Apr 2018
Technical level: Advanced

Upendra Singh

Video thumbnail

How organizations can leverage 'Large Scale Graph Based Analytics’ to derive value from their data.

An organization’s data is like a living organism - growing, expanding and evolving over time to form complicated and connected systems. This is similar to biological evolution, where life forms evolved from simple unicellular structures to more and more complex multicellular organisms. And as organizations compile more and more data, it is crucial for them to understand that the value of any data… more
  • 2 comments
  • Awaiting details
  • 12 Apr 2018
Technical level: Advanced

Vamsi A

Preparing AWS instances for performance and resource intensive application

The ASR module of Samsung’s Bixby does the Speech to Text translation. The accuracy of the output depends heavily on the quality of models involved. more
  • 2 comments
  • Awaiting details
  • 17 Apr 2018
Technical level: Beginner

Abhijith Chandraprabhu

Introductory workshop on Computational Machine Learning

You have been hearing about machine learning (ML) and artificial intelligence (AI) everywhere. You have heard about computers recognizing images, generating speech, natural language, and beating humans at Chess and Go. In this workshop, you will be learning the math and program the math of AI, for example we start by hand coding all the components of a neural network, without calling any librarie… more
  • 6 comments
  • Submitted
  • 01 May 2018
Technical level: Intermediate

Sampriti Das

How to eat an elephant?

Target Corporation is a global retail organization, has around 1800 stores & a digital presence, with over 350k team members. Given the scale and scope of the organization, it is even more important to have appropriate automated management of data in place. In this paper, we would share a story about how an ETL process, primarily used for Target’s Identity and Access Management (IAM) governance w… more
  • 3 comments
  • Awaiting details
  • 04 May 2018
Technical level: Beginner

Sanket Sudake

Machine Learning on Kubernetes using Kubeflow

Deploying applications with containers is now a de-facto standard & Kubernetes is a preferred orchestrator for deploying containers. Using kubernetes to build/train/deploy machine Learning application is desired considering out-of-box feature which kubernetes provides like autoscaling, self-healing, rolling upgrade support etc. more
  • 1 comment
  • Submitted
  • 16 May 2018
Technical level: Intermediate

Varun Khandelwal

Solving real world optimization problems using AI & ML

In this session i want to talk about how I have used ML, AI & IoT to solve real world complex business problems. Some of the problems I have worked on are: more
  • 3 comments
  • Waitlisted
  • 26 Mar 2018
Technical level: Advanced

Amir Nagri

Realtime System Uptime Tracking for 2M req/min

@GO-JEK, we serve more than 3M+ customers daily. This involves rigorous discipline around system reliability and availability. One of the tools that helps us achieve this goal is our Realtime System Uptime Tracking. more
  • 2 comments
  • Awaiting details
  • 26 Feb 2018
Technical level: Intermediate

Vidyasagar Nallapati

Serverless Machine Learning Platform

A practical, architectural talk on a scalable, platform for machine learning as a service built on the principles of serverless architecture more
  • 1 comment
  • Submitted
  • 30 Apr 2018
Technical level: Intermediate

SUNIL KUMAR

Video thumbnail

Introduction to Deep Learning using TensorFlow

The hands-on workshop will cover basics on machine and deep learning. It will also include custom training of TensorFlow. The attendees will learn how to use deep learning tools to construct prototype of real world applications in multiple domains. more
  • 14 comments
  • Waitlisted
  • 20 Mar 2018
Technical level: Intermediate

Ashish Mukherjee

Crisptalk - Rule-based SMS analysis for Credit Risk Insights

The lending space is rapidly growing in the fintech arena in India. The challenge faced by this sector is operational scaling of the business to an unprecedented level without adding large amount of manpower. In order to meet this challenge, the traditional process must be replaced in part or whole by digital data sources, one of which is SMS. The solution designed is able to analyse SMS data and… more
  • 11 comments
  • Awaiting details
  • 12 Jun 2018
Technical level: Intermediate

Chandulal Kavar

From Application Developer to Big Data Engineer

In this talk, I will talk about my journey from working as an App Dev to a Data Engineer at ThoughtWorks. This transition helped me to change my mindset around Big Data as well as problem-solving techniques required for distributed computing problems. I will also cover how one API over other will affect the performance and memory usage. Furthermore, I will show some of these techniques through so… more
  • 2 comments
  • Awaiting details
  • 13 Mar 2018
Technical level: Beginner

Amit Kapoor

Architectural Design for Interactive Visualization

Visualisation for data science requires an interactive visualisation setup which works at scale. In this talk, we will explore the key architectural design considerations for such a system and illustrate using examples the four key tradeoffs in this design space - rendering for data scale, computation for interaction speed, adaptive to data complexity and responsive to data velocity. The key take… more
  • 1 comment
  • Submitted
  • 28 Jun 2018
Technical level: Beginner

ravi teja

When NLP meets Analytics @ Sentienz

“What’s my best selling product in India in March?” more
  • 1 comment
  • Submitted
  • 03 Jul 2018
Technical level: Intermediate

Raghunandh GS

Mapping Movies

We are a movie-loving nation. In our country, approximately 2.5 Billion movie tickets are being sold every year. We grew up watching movies and we bring up a lot of movie references in our day to day conversations. But have we ever thought of exploring and understanding movies through data? This talk will revolve around the worderful world of movies, data and design. more
  • 16 comments
  • Under evaluation
  • 01 Aug 2018
Technical level: Intermediate

Anusha Venkat

How does big data & data insights drive business. New trends in enterprise data services

How does big data & the insights drive Business for PayPal, JIO , eBay, etc. What are the new trends that provide data insights that drives business towards success. Key performance indicators and the actions surrounding them. more
  • 1 comment
  • Submitted
  • 30 Aug 2018
Technical level: Intermediate

Abhishek Narain

Enabling modern data integration in Azure cloud

This session walks through a comprehensive set of new additions to Azure Data Factory (ADF) and SQL Server Integration Services (SSIS) for moving and integrating data across on-premises and cloud. Topics and examples focus on the needs of the data integrator and data engineer for data warehousing with Big Data, business intelligence, and advanced analytics in SaaS applications. Learn how Microsof… more
  • 10 comments
  • Awaiting details
  • 02 Nov 2018
Technical level: Intermediate

Satish Gopalani

Generating Data Analytics Reports using Scalable Config Driven Framework

Generating a prolific number of Analytics Reports from 100’s of different dimensions and metrics for customers and internal stakeholders has been a critical work of BigData Analytics team at PubMatic. Writing custom jobs to provide analytic reports, leads to repetitive efforts and redundancy of business logic in many different jobs. Another challenge is scaling the platform which already processe… more
  • 14 comments
  • Awaiting details
  • 04 Sep 2018
Technical level: Intermediate

sanjiv soni

Swing and a Miss: Deploying machine learning models for IoT enabled devices using Python

The primary purpose of this talk to describe how we are using python and sklearn to model and analyse time series sensor data. In particular, I will walk through how we use Python to process data from an IoT enabled sensor attached to a cricket bat, build machine learning models on the data, and use open source tools to deploy our models in the sensor device as a smart IoT application. more
  • 8 comments
  • Awaiting details
  • 04 Sep 2018
Technical level: Beginner

Aravind Putrevu

Elasticsearch : Distributed System for Large Scale Data needs

Often late, data explosion is a normal thing with any business. Developers collect, contain and categorize data for various use cases. more
  • 229 comments
  • Under evaluation
  • 16 Oct 2018
Technical level: Beginner

Keerthi Prasad

End-to-end automated data science process using Airflow.

Evive is a data driven benefit navigator. We provide our 25+ million users with personalised recommendations on their health and wealth. We have 50+ models running on a daily basis for the recommendations. We receive around 500+ gigabytes of data coming from 30+ different sources, on a daily basis. more
  • 5 comments
  • Under evaluation
  • 15 Oct 2018
Technical level: Beginner

Aiko Klostermann

The Deep Learning Showdown: How to pick the right tool for the job?

When you have a data centric problem to solve and you look for a technology to support you with this: The machine intelligence landscape can be overwhelming. I analysed the landscape using a data driven approach and condensed the outcome into a consumable from. Additionally I came to the conclusion that there is a set of questions you have to ask yourself to make the best possible choice for your… more
  • 8 comments
  • Under evaluation
  • 01 Feb 2019
Section: Full talk Technical level: Intermediate

Atl Arredondo

Data Lineage Service at Slack

At Slack, the data engineering team has built tools that allow engineers and other people in the company to create their own data pipelines, run interactive queries and build dashboards. Over time the data volume, the number of datasets and the dependencies between them has increased. This has made data discovery hard and impacted the reliability and trust of our datasets. In addition, incidents … more
  • 2 comments
  • Submitted
  • 01 Jun 2019
Session type: Full talk of 40 mins

Hosted by

Jump starting better data engineering and AI futures