Jul 2016
25 Mon
26 Tue
27 Wed
28 Thu 08:30 AM – 06:25 PM IST
29 Fri 08:30 AM – 06:15 PM IST
30 Sat 08:45 AM – 05:00 PM IST
31 Sun 08:15 AM – 06:00 PM IST
Suchana Seth
The talk will focus on ML specific challenges to designing data science systems, how such systems acquire technical debt, and what we can do at design level to mitigate some of the risks.
Key takeaway
Learn how to foresee these pitfalls & design your pipelines and systems to avoid them.
This talk is intended for an audience already familiar with applying machine learning algorithms.
In this talk, we’ll cover these sources of risk to ML systems -
Data drift - how to handle feature distributions that shift with time
Post model heuristics - when and how to add heuristics to model output
Hidden downstream consumers - how to identify and plan for these
Unacknowledged data dependencies - how to identify and plan for these
Feedback loops - the good and the bad
Decision thresholds & action limits - how to keep them sane
Reproducibility - how to ensure it
Suchana is a physicist-turned data scientist with 8 years of experience research, startups and product labs. She volunteers with DataKind in her free time, and mentors data-for-good projects.
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}