The Python ecosystem for data science - Landscape Overview

Jul 2017

24 Mon

25 Tue

26 Wed

27 Thu 08:15 AM – 10:00 PM IST

28 Fri 08:15 AM – 06:25 PM IST

29 Sat

30 Sun

MLR Convention Centre, Whitefield, Bengaluru,

The Python ecosystem for data science - Landscape Overview

Submitted Apr 27, 2017

Section: Full talk for data engineering track Technical level: Beginner

In their day-to-day jobs, data science teams and data scientists face challenges in many overlapping yet distinct areas such as Reporting, Data Processing & Storage, Scientific Computing, ML Modelling, Application Development. To succeed, Data science teams, especially small ones, need a deep appreciation of these dependencies on their success.

Python ecosystem for data science has a number of tools and libraries for various aspects of data science, including Machine Learning, Cluster Computing, Scientific Computing, etc.

The idea of this talk is to understand what the Python data science ecosystem offers (so that you don’t reinvent it), what are some common gaps (so that you don’t go blue looking for answers).

In this talk, we describe how different tools/libraries fit in the machine learning model development and deployment workflow . This talk is about how these different tools work (and don’t work) together with each other. It is intended as a landscape survey of the python data science ecosystem, along with a mention of some common gaps that practitioners may notice as they put together a stack and/or an application for their company.

Outline

Evolving Role of Data Science Teams
Machine Learning vs Real World Data Science
Challenges faced by Data Science Teams
Data Science Workflow
Python Ecosystem
Review of Key Tools
Use Cases
What Works
Gaps from a practitioner viewpoint

Requirements

Machine learning practitioners, startups, new data science teams

Speaker bio

Ananth Krishnamoorthy Ph.D. specializes in applying analytical techniques based on mathematical optimization, machine learning, discrete event simulation, and time series analysis, to real world business problems across various industry sectors. He has delivered several business consulting, analytical solution development, and technology implementation projects over the last 17 years.

Ananth is the co-founder of rorodata, a startup that is building a cloud based data science platform. He is also head of Hypercube Analytics, an analytics consulting company. Ananth holds a Ph.D. in Industrial Engineering and Management from Oklahoma State University

Slides

https://www.slideshare.net/ananthkrishnamoorthy/the-python-ecosystem-for-data-science-landscape-overview

The Fifth Elephant 2017