Anthill Inside 2019

On infrastructure for AI and ML: from managing training data to data storage, cloud strategy and costs of developing ML models

Production Object Detection - A Journey of Training, Building and Deploying CV models

Submitted by Tarang Shah (@tarang27) on Saturday, 20 October 2018

Technical level: Beginner

Abstract

Computer Vision as a field has changed manifold in the past few years. Researchers publish their papers and at times their code for the latest algorithms, but the challenge for the industry remains in applying that research to their processes.
Customising a company’s proprietary data for the research models, implementing their code, and training models is the first big hurdle. Then comes the part where we have to test and release these latest models to production.
In this talk we will go through a project where we did exactly the above at Here Technologies. The audience will learn abot the main issues we faced, how we overcame it and other best practises, including optimising AWS infrastructure for Machine Learning DevOps.

Outline

  1. Overview of object detection approaches
  2. Training Data Prep - Including handling data on the cloud
    1. Data collection - Sampling
    2. Annotation and review approaches - human, automated
  3. Actually training the model - hardware/cloud/best practices
    1. Troubles with large data sets, how to deal with issues when you hit the limit of state of the art hardware
  4. Evaluation of the model results
  5. Double checking the evaluation - blind test dataset
  6. Release and integration with systems
  7. Deployment and Infrastructure

Requirements

None as such. For the talk, the only pre req is basic knowledge of machine learning terminology.

Speaker bio

I’m an engineer involved in computer vision and robotics since 5+ years. I have worked on various computer vision and data science projects including an autonomous soccer playing humanoid(acyut.com), OCR(text extraction/transcription) and object detection models. As a computer vision and data science practitioner who has faced and overcome challenges in production systems, it would be great to share some of that knowledge for the benefit of the community.

Links

Comments

Login with Twitter or Google to leave a comment