The Fifth Elephant 2020 edition

The Fifth Elephant 2020 edition

On data governance, engineering for data privacy and data science

The ninth edition of The Fifth Elephant will be held in Bangalore on 16 and 17 July 2020.

The Fifth Elephant brings together over one thousand data scientists, ML engineers, data engineers and analysts to discuss:

  1. Data governance
  2. Data privacy and engineering for privacy including engineering for Personal Data Protection (PDP) bill.
  3. Data cleaning, annotation, instrumentation and productionizing data science.
  4. Identifying and handling fraud + data security at scale
  5. Feature engineering and ML platforms.
  6. What it takes to create data-driven cultures in organizations of different scales.

**Event details:

Dates: 16-17 July 2020
Venue: NIMHANS Convention Centre, Dairy Circle, Bangalore

Why you should attend:

  1. Network with peers and practitioners from the data ecosystem.
  2. Share approaches to solving expensive problems such as cleanliness of training data, annotation, model management and versioning data.
  3. Demo your ideas in the demo sessions.
  4. Join Birds of Feather (BOF) sessions to have productive discussions on focussed topics. Or, start your own Birds of Feather (BOF) session.

Contact details:
For more information about The Fifth Elephant, call +91-7676332020 or email sales@hasgeek.com


Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Pranjal Sanjanwala

@pranjalsanjanwala

Solving for Bias In E-Commerce Autosuggest

Submitted May 30, 2020

80 Million products across 80+ categories is what Flipkart’s Search enables discovery for. And, in a user’s journey of discovering products, she is shown with autosuggest suggestions to choose from while typing a query. These suggestions don’t just help users in choosing a well formed query with minimal typing effort, there is more to it.

This talk briefly touches upon the opportunities that decorating these suggestions brings to us.
After setting the context of how product popularity has led to a never ending loop in the system leading to this bias, I’ll be walking the audience through our journey of solving the problem of less sought categories not visible on autosuggest due to it.

We’ll start discussing our journey with an implementation that randomly chooses store decorations and the unexpected learnings that it gave us. Further ahead, we’ll look at the possible rewards that are relevant to autosuggest and the observations from our first reward based decoration selection algorithm which pretty much solves for the bias but misses to make its mark on the constraints that the problem poses. We’ll look at how looking at rewards as distributions gave further improvement but affected our metrics for quite some time initially. Introducing priors helped us with reducing the initial adjustment period and also showed interesting patterns around the impact of priors on overall convergence. We’ll close the discussion with the learnings at each step in our journey and the future work.

Outline

Problem Background

  1. Autosuggest in search
  2. Role of decoration as a two-way communication channel with the user
  3. What is the bias that we are trying to solve for and why is it there in the first place (with illustrations)
  4. Problem Definition : Goals & Constraints
  5. Issues with the existing reward (continuing the same illustration)

Journey of solving for it

  1. Explore Exploit as a solution

  2. First step towards solving : Random Selection

  • User Experience View
  • Observations
  • Merits of starting with random exploration
  1. Moving towards performance reward based exploration
  • Choice of reward and its pros and cons
  • Our way of implementing a performance based exploration algorithm
  • Convergence Illustrations
  • Movement in overall store visibility landscape
  • There was still scope for improvement, so what next?
  1. Need to account for regret along with reward
  • Visualising store decorations as Beta distributions. sampling on them for decoration selection
  • Convergence improvement
  • Movement in overall store visibility landscape
  • Observations : Slower convergence
  1. Solve for faster convergence
  • How priors affect movement in arms and the gains observed from having priors in sampling
  1. Future Work

Note : All sections will include illustrations, metrics movement and changing suggestions for the aforementioned example

Speaker bio

Pranjal is a Software Development Engineer with Flipkart Search

  • www.linkedin.com/in/pranjalsanjanwala

Slides

https://docs.google.com/presentation/d/1Ek27iIBoFB23-h_YVpkYpd50qaKx-Lb4PIhISeHRJPg/edit?usp=sharing

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more