The Fifth Elephant For members

The Fifth Elephant 2023 Monsoon

On AI, industrial applications of ML, and MLOps

Make a submission

Accepting submissions till 04 Jul 2023, 12:30 PM

Bangalore International Centre (BIC), Bengaluru

Tickets

Loading…

The Fifth Elephant 2023 Monsoon Edition event recap is now up here . The event was attended by 192 participants, of which one-fourth were women. The Fifth Elephant videos are available to watch here

Event highlights:




Editors

The 2023 Monsoon edition is curated by:

  1. Nischal HP, Vice President of Data Engineering and Data Science at Scoutbee. Nischal curated the MLOps conference which was held online between 23 and 27 July 2021.
  2. Sumod Mohan, Founder and CEO at AutoInfer. Sumod curated Anthill Inside 2019 edition, held in Bangalore on 23 November.

Tracks and themes

  1. AI and Research - covers research, findings, and solutions for challenges on building models in various areas such as fraud detection, forecasting, and analytics. This track delves into the latest methodologies for handling challenges such as large-scale data processing, distributed computing, and optimizing model performance.
  2. Industrial applications of ML - covers implementation of AI in the industry, with more focus on the AI models, the issues in training, gathering data so, and so forth. ML is being used at scale in industries such as automotive, mechanical, manufacturing, agriculture, and such domains. This track focuses on the challenges in this space, as we see innovation coming out of these industries in the pursuit of using ML on a second-to-second basis.
  3. AI and Product - covers strategies for building AI products to scale and mitigating challenges. This track provides insights on incorporating AI tools and forecasting techniques to improve model training, developing a working model architecture, and using data in the business context.

There are three phases in the lifecycle of an application - research, application and aftermath of the application.

  1. Assess capabilities, determining the new frontiers for AI.
  2. Find a use for the application.
  3. Learn how to run it, monitor it and update it with time.

The three tracks at the 2023 Monsoon edition of The Fifth Elephant will cover this lifecycle.

Members-only conference

The Fifth Elephant 2023 Monsoon edition will be held in-person. Attendance is open to The Fifth Elephant members only. Pick a membership to attend the in-person conference. If you have questions about participation, post a comment here.

Who will benefit from participating in The Fifth Elephant community:

  1. Data/MLOps engineers who want to learn about state-of-the-art tools and techniques, especially from domains such as automobile, agri-tech and mechanical industries.
  2. Data scientists who want a deeper understanding of model deployment/governance.
  3. Architects who are building ML workflows that scale.
  4. Tech founders who are building products that require AI or ML.
  5. Product managers, who want to learn about the process of building AI/ML products.
  6. Directors, VPs and senior tech leadership who are building AI/ML teams.

Sponsorship

Sponsorship slots are open for:

  1. Infrastructure (GPU, CPU and cloud providers) and developer productivity tool makers who want to evangelise their offering to developers and decision-makers.
  2. Companies seeking tech branding among AI and ML developers.
  3. Venture Capital (VC) firms and investors who want to scan the landscape of innovations and innovators in AI and who want to source leads for investment in the AI and ML space.

Contact information

Join the @fifthel Telegram group or follow @fifthel on Twitter. For any inquiries, call Hasgeek at +91 7676 33 2020.

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Supported by

India’s Top Advanced Cloud GPU Provider. H100, A100, L4, A40,& A30 Sign up here: https://bit.ly/vc_desk more

Dhruv Nigam

@dhruvn

Predicting customer lifetime value in a non-contractual digital commerce setting

Submitted Jun 20, 2023

At Dream 11, we have built a Customer lifetime value(CLTV) model to predict each user’s future lifetime value. There are two broad areas where having a future-looking estimate of customer value can help.

Personalization

Having user-level customer lifetime value predictions enables us to personalize our platform for each user including personalized marketing campaigns, discounts, and recommendations. Eg. Users having a high predicted value might be offered higher discounts to improve retention since we will likely recover the costs of the higher discounts.

A proxy metric for long-term value

Experiments are central to our culture. However, we can only run each experiment for a finite time. Often we are interested in how short-term interventions affect long-term metrics. in extreme cases, certain features might improve short-term metrics while degrading long-term customer value. Change in future predicted lifetime value helps establish whether a certain treatment had a meaningful long-term impact on the target versus the control group.

Technical challenges faced

In a fast-moving e-commerce setting where user behavior is dynamic and there is an inherent seasonality to the business, getting long-term unbiased estimates at a user level is almost impossible. However, the most important use cases for CLTV estimates that do not necessitate accurate predictions are the user level. Some, like personalization, can work reasonably well even if we can predict the cross-sectional ranks of users. Other use cases where we want to assess the impact of a feature on long-term value rely only on cohort-level estimates which can be much more accurate because of variance reduction assuming i.i.d. errors for user-level estimates.

One of the biggest challenges we faced was the distribution of our target variable. We chose the future cumulative Contest Entry amount(CEA, roughly equivalent to revenue) over 360 days as our target variable. CEA, being a monetary metric, is highly skewed. Its distribution exhibits two key features-

  • A fat tail indicated a few users who contribute abnormally high CEA
  • A concentrated probability mass at 0 represents users who have churned out of the system and contribute exactly zero CEA.
  • Seasonality. Being in the fantasy space, user activity is heavily clustered around major sporting events

These distributional quirks are not unique to Dream11. Almost any e-commerce business with non-contractual(not subscription) customer engagement will display similar distributions.

Conventional evaluation metrics and loss functions fall short of capturing the nuances of this distribution that we call zero-inflated log-normal distribution. We address these challenges using customer loss functions like Tweedie loss and evaluation metrics like normalized Gini index.

Dream11 currently has close to ~80 million paying users. The scale of data to be processed for feature engineering, model training, and inference presents another challenge. We used spark to scale training and inference to meet the scale requirements.

Adoption and usability

We demonstrate that using this single model, how we were able to-

  • personalizing treatments - specifically marketing and promotions, driving higher retention and ROI
  • establishing the quality of marketing channels(google, Facebook, etc.) based on the lifetime value of users acquired through it - enabling quick feedback for optimizing acquisition spending.
  • quantifying the long-term impact of new product features beyond the experimentation phase.

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Make a submission

Accepting submissions till 04 Jul 2023, 12:30 PM

Bangalore International Centre (BIC), Bengaluru

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Supported by

India’s Top Advanced Cloud GPU Provider. H100, A100, L4, A40,& A30 Sign up here: https://bit.ly/vc_desk more