Schedule

The Fifth Elephant 2017

On data engineering and application of ML in diverse domains

Auditorium

Banquet hall

Room 1

08:15–09:30

Check-in and breakfast

09:30–09:40

Stretching session

09:30–09:40

HasGeek app demo

09:40–10:25

Lessons learned from building a globally distributed database service from the ground up.

Dharma Shukla, Microsoft

09:40–10:25

How we built our machine intelligence to help humans save lives.

Zainul Charbiwala, Tricog

10:25–11:10

Fraud detection and risk management in payment systems implemented using a hybrid memory database.

Srini V. Srinivasan, Aerospike

10:25–10:35

Stretching session

10:55–11:35

Interactive real-time dashboards on data streams using Kafka, Druid and Superset.

Nishant Bangarwa, Hortonworks

11:10–11:45

Morning beverage break (auditorium)

11:35–12:05

Morning beverage break (banquet hall)

11:45–12:00

Introductions to The Fifth Elephant and Anthill Inside; HasGeek app demo

12:00–12:45

Distributed consensus and data safety: NewSQL perspective

Vijay Srinivas Agneeswaran

12:05–12:35

Transforming India's budgets into open linked data.

Gaurav Godhwani, OpenBudgetsIndia.org and DataKind

12:35–13:05

Open data in government: challenges, and the case of Telangana Open Data initiative.

Rakesh Dubbudu, Factly

12:45–12:55

Stretching session

12:55–13:40

What database? - a practical guide to selection from NoSQL, SQL and Polyglot data stores

Regunath Balasubramanian

13:05–14:05

Lunch break (banquet hall)

13:40–14:40

Lunch break (auditorium)

14:05–14:25

Wait, I can explain this! ML models explaining their predictions.

Ramprakash R, Zoho Corporation

14:05–15:05

Off The Record (OTR) session: "Using open data in different scenarios – challenges and opportunities."

14:25–14:35

Stretching session

14:35–15:20

From a recommendations carousel to personalizing entire app: personalization story at Paytm.

Charumitra Pujari, Paytm

14:40–15:00

Q&A with speakers on data stores and databases.

Dharma Shukla, Srini Srinivasan, Vijay Sreenivas Agneeswaran and Regunath Balasubramanian

15:00–15:10

Stretching session

15:10–15:30

Distributed ML: challenges and opportunities.

Anand Chitipothu, Rorodata

15:15–16:00

Off The Record (OTR) session: "Interesting problems to solve with data science."

Led by Sritulasi E, Suchana Seth, Ashwin Kumar

15:20–15:40

Adapting bandit algorithms to optimise user experience at Practo Consult.

Santosh GSK, Practo

15:30–16:00

Suuchi: toolkit to build distributed systems.

Sriram R, Indix

15:40–16:10

Evening beverage break (banquet hall)

16:00–16:30

Evening beverage break (auditorium)

16:10–17:10

Off The Record (OTR) session: "Fairness, Accountability and Transparency (FAT) in ML."

Facilitated by Alok Prasanna Kumar with Suchana Seth, Suhaan Mukerji, Harsh Gupta, Paul Meinshausen

16:30–17:15

Machine Learning: from practice to production.

Ramanan Balakrishnan, Semantics3

17:10–17:40

Flash talks: presentations by the audience

17:15–18:15

Off The Record (OTR) session: "Data science in production."

Led by Bargava S, Anand C, Ragotham S.

18:15–22:00

The Fifth Elephant 2017 and Anthill Inside 2017 "Networking Data Scientist dinner."

Auditorium

Banquet hall

Room 1

08:15–09:30

Check-in and breakfast

09:30–10:15

Interactive data visualisation using Markdown.

Amit Kapoor, narrativeVIZ Consulting

09:30–09:40

Stretching session

09:40–10:25

Bits and joules: data-driven energy systems.

Deva P. Seetharam, Dataglen

10:15–10:35

What explains our marks?

Anand S, Gramener

10:25–11:10

Do you know what's on TV?

Bharath Mohan, Sensara.TV

10:35–11:20

Maps ❤️ Data: a voyage across the world of geo-visualization.

Rasagy Sharma, Mapbox

11:10–11:40

Morning beverage break (banquet)

11:20–11:50

Morning beverage break (auditorium)

11:40–12:25

Designing ML pipelines for mining transactional SMS messages

Paul Meinshausen, Montane Ventures

11:50–12:00

Stretching session

12:00–12:45

Off The Record (OTR) session: "Finance data analytics."

12:00–12:45

Off The Record (OTR) session: "Data Visualization."

Facilitators: S Anand, Rasagy Sharma, Amit Kapoor

12:25–12:45

Q&A with presenters on use cases

Paul Meinshausen and Bharath Mohan

12:45–13:30

Gabbar: Machine learning to guard OpenStreetMap

Bhargav Kowshik, Mapbox

12:45–13:45

Lunch break (banquet hall)

13:30–14:30

Lunch break (auditorium)

13:45–14:05

How we are building serverless architectures for Deep Learning and NLP at Episource.

Manas Ranjan Kar, Episource

13:45–14:30

Off The Record (OTR) session: "Worst disaster stories in ML."

Led by Bargava S

14:05–14:25

Plumbing data science pipelines.

Krishnapriya Satagopan, Mad Street Den

14:25–14:55

Governance using Apache Atlas: why and how.

Vimal Sharma, Hortonworks

14:30–15:15

5 Lessons I’ve Learned Tackling Product Matching for E-commerce

Govind Chandrasekhar, Semantics3

14:30–15:15

Off The Record (OTR) session: "Spark: use cases and challenges in production."

14:55–15:05

Stretching session

15:05–16:05

Off The Record (OTR) session: "Securing data stored in the cloud for big data analysis."

Anand V, Sandesh Anand, Girish Patil

15:15–15:25

Stretching session

15:25–16:25

Off The Record (OTR) session: "ML and data engineering in fleet management and logistics."

Ashwin Kumar (data scientist at Uber), Vinayak Hegde (ZoomCar), Pranav Saxena (product manager at Flipkart)

15:25–16:10

Off The Record (OTR) session: "Experiences and challenges in working with Druid."

Led by Nishant Bangarwa

16:05–16:35

Evening beverage break (banquet hall)

16:25–17:05

Evening beverage break (auditorium)

16:25–17:25

Off The Record (OTR) session: "Learning data science."

Led by Bargava S, Amit Kapoor

17:05–17:25

Augmenting Solr’s NLP capabilities with Deep Learning features to match images.

Kumar Shubham, DataWeave

17:25–18:10

Near real-time indexing/search in e-commerce marketplace: approaches and learnings.

Umesh Prasad, LucidWorks

17:25–17:40

Feedback and closing

17:40–18:25

Flash talks – presentations by the audience

18:10–18:25

Feedback and closing

Hosted by

Jump starting better data engineering and AI futures