The Fifth Elephant For members

The Fifth Elephant 2023 Monsoon

On AI, industrial applications of ML, and MLOps

Make a submission

Accepting submissions till 04 Jul 2023, 12:30 PM

Bangalore International Centre (BIC), Bengaluru

Tickets

Loading…

The Fifth Elephant 2023 Monsoon Edition event recap is now up here . The event was attended by 192 participants, of which one-fourth were women. The Fifth Elephant videos are available to watch here

Event highlights:




Editors

The 2023 Monsoon edition is curated by:

  1. Nischal HP, Vice President of Data Engineering and Data Science at Scoutbee. Nischal curated the MLOps conference which was held online between 23 and 27 July 2021.
  2. Sumod Mohan, Founder and CEO at AutoInfer. Sumod curated Anthill Inside 2019 edition, held in Bangalore on 23 November.

Tracks and themes

  1. AI and Research - covers research, findings, and solutions for challenges on building models in various areas such as fraud detection, forecasting, and analytics. This track delves into the latest methodologies for handling challenges such as large-scale data processing, distributed computing, and optimizing model performance.
  2. Industrial applications of ML - covers implementation of AI in the industry, with more focus on the AI models, the issues in training, gathering data so, and so forth. ML is being used at scale in industries such as automotive, mechanical, manufacturing, agriculture, and such domains. This track focuses on the challenges in this space, as we see innovation coming out of these industries in the pursuit of using ML on a second-to-second basis.
  3. AI and Product - covers strategies for building AI products to scale and mitigating challenges. This track provides insights on incorporating AI tools and forecasting techniques to improve model training, developing a working model architecture, and using data in the business context.

There are three phases in the lifecycle of an application - research, application and aftermath of the application.

  1. Assess capabilities, determining the new frontiers for AI.
  2. Find a use for the application.
  3. Learn how to run it, monitor it and update it with time.

The three tracks at the 2023 Monsoon edition of The Fifth Elephant will cover this lifecycle.

Members-only conference

The Fifth Elephant 2023 Monsoon edition will be held in-person. Attendance is open to The Fifth Elephant members only. Purchase a membership to attend the conference in-person. If you have questions about participation, post a comment here.

Who will benefit from participating in The Fifth Elephant community:

  1. Data/MLOps engineers who want to learn about state-of-the-art tools and techniques, especially from domains such as automobile, agri-tech and mechanical industries.
  2. Data scientists who want a deeper understanding of model deployment/governance.
  3. Architects who are building ML workflows that scale.
  4. Tech founders who are building products that require AI or ML.
  5. Product managers, who want to learn about the process of building AI/ML products.
  6. Directors, VPs and senior tech leadership who are building AI/ML teams.

Sponsorship

Sponsorship slots are open for:

  1. Infrastructure (GPU, CPU and cloud providers) and developer productivity tool makers who want to evangelise their offering to developers and decision-makers.
  2. Companies seeking tech branding among AI and ML developers.
  3. Venture Capital (VC) firms and investors who want to scan the landscape of innovations and innovators in AI and who want to source leads for investment in the AI and ML space.

Contact information

Join the @fifthel Telegram group or follow @fifthel on Twitter. For any inquiries, call Hasgeek at +91 7676 33 2020.

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Supported by

India’s Top Advanced Cloud GPU Provider. H100, A100, L4, A40,& A30 Sign up here: https://bit.ly/vc_desk more

This video is for members only

Meghana Negi

@meghana_n

Solving for explainability of fraud detection models

Submitted Jun 16, 2023

Problem :
At the TnS(Trust and Safety) team at Swiggy, building powerful fraud detection models that operate at high precision while still capturing maximum fraud has been the uber goal. Our system currently operates at a high level of complexity through various interventions, modelling techniques, and semi-supervised training methods while maintaining robustness.
For the final downstream model, we have always relied on tree-based learners over neural networks. Since are data is primarily tabular in nature, tree-based learners outperformed DNNs significantly on the winning metrics. While tree-based learners are great performers in terms of the final metrics that we’re looking to optimise, it has a few challenges:
1. It inherently restricts us from trying out more complex data structures like images or sequential data, we have tried to integrate such signals through a separate model whose final score is fed into the tree based learner but it significantly adds to complexity of the system.
2. A major press point for Fraud models historically has been a lack of explainability in predictions. We have experimented with LIME and SHAP-based approaches to build an explainable overhead but they’re computationally expensive to run for each record.

Solution:
While tree-based methods for a deployable model have all these challenges, what works in their favour is that they have historically outperformed DL-based methods by a significant margin. This changes with TabNet, in the original paper(Ref), authors claim that TabNet can match or even outperform tree-based methods while also giving sample-level explainability, which we can also visualise. We explored a tabnet based model for our approach and found it to be on par with tree-based counterpart(xgboost). TabNet also allowed us to compute and store feature level attention within the model logs without any computational overhead.

Outline:
In the presentation, we’ll be going through the following in depth.
Current pipeline and solution
Challenges in depth
Motivation for TabNet and what it unlocks
Experimental results and conclusion

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Make a submission

Accepting submissions till 04 Jul 2023, 12:30 PM

Bangalore International Centre (BIC), Bengaluru

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Supported by

India’s Top Advanced Cloud GPU Provider. H100, A100, L4, A40,& A30 Sign up here: https://bit.ly/vc_desk more