The Fifth Elephant 2018

The seventh edition of India's best data conference

Managing Machine Learning Models in Production

Submitted by Anand Chitipothu (@anandology) on Saturday, 31 March 2018

videocam_off

Technical level

Intermediate

Section

Crisp talk

Status

Submitted

Vote on this proposal

Login to vote

Total votes:  +3

Abstract

Deploying machine models in production is not a trivial task.

There are many challenges like managing multiple versions of models, maintaining staging and production models, keeping track of model performance, logging, scaling etc.

This session explores the tools, techniques and system architecture of a cloud platform built to solve these challenges and the new opportunities it opens up.

Outline

Typically, data scientists build machine learning models and ask IT specialists in their team to deploy these models. With teams becoming smaller and the quest for increased productivity, few data science teams have luxury of specialists at their beck and call.

Even with dedicated IT teams, managing models in production is not a trivial task. As the number of models and team size increases the complexity only grows.

How to manage multiple versions of a model? How to version control the datasets used for model building? How to tag production and staging versions of a model? How to switch from one version to another seamlessly without any service disruption? How to monitor performance of a live model?

This session explores tools, techniques and system architecture used to build a cloud-based platform to address all the above issues with couple of case studies.

Take Aways

  • Understand the complexities involved in building & managing machine learning products
  • How these complexities change as the team size grow
  • Understand the importance of investing early in the appropriate tools
  • Techniques, tools and ideas that can be adopted in any typical ML workflow

Speaker bio

Anand has been crafting beautiful software since a decade and half. He’s now building a data science platform, rorodata, which he recently co-founded. He regularly conducts advanced programming courses through Pipal Academy. He is co-author of web.py, a micro web framework in Python. He has worked at Strand Life Sciences and Internet Archive.

Slides

https://speakerdeck.com/anandology/mananging-machine-learning-models-in-production-strata-singapore-2017

Comments

  • 1
    Zainab Bawa (@zainabbawa) Reviewer 8 months ago

    The first part of the talk lays out the problem space very nicely. The second part of the talk becomes a pitch for RoRoData since it goes into the solution space. How do we deal with this?

  • 1
    Anand Chitipothu (@anandology) Proposer 8 months ago

    That is a problem with being in a product business. I’m not sure where to draw the line. Happy to discuss.

Login with Twitter or Google to leave a comment