The Fifth Elephant 2020 edition

The Fifth Elephant 2020 edition

On data governance, engineering for data privacy and data science

The ninth edition of The Fifth Elephant will be held in Bangalore on 16 and 17 July 2020.

The Fifth Elephant brings together over one thousand data scientists, ML engineers, data engineers and analysts to discuss:

  1. Data governance
  2. Data privacy and engineering for privacy including engineering for Personal Data Protection (PDP) bill.
  3. Data cleaning, annotation, instrumentation and productionizing data science.
  4. Identifying and handling fraud + data security at scale
  5. Feature engineering and ML platforms.
  6. What it takes to create data-driven cultures in organizations of different scales.

**Event details:

Dates: 16-17 July 2020
Venue: NIMHANS Convention Centre, Dairy Circle, Bangalore

Why you should attend:

  1. Network with peers and practitioners from the data ecosystem.
  2. Share approaches to solving expensive problems such as cleanliness of training data, annotation, model management and versioning data.
  3. Demo your ideas in the demo sessions.
  4. Join Birds of Feather (BOF) sessions to have productive discussions on focussed topics. Or, start your own Birds of Feather (BOF) session.

Contact details:
For more information about The Fifth Elephant, call +91-7676332020 or email sales@hasgeek.com


Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Vishal Gupta

@vizgupta

Predicting Deal Closure in a Sales CRM using Email Sentiment

Submitted May 31, 2020

Emails are the most common form of communication in a sale and can be used to actively determine the customer’s interest in purchasing a product/service. Statistically, deals with more email replies from the customer are more likely to win. Our project, Deal sentiment at Freshworks as a part of the Freshsales CRM involves predicting sentiment from customers’ and agents’ mails and using it to estimate the probability of the deal winning.

Outline

Cleaning and Parsing emails (Data cleaning)

  • Parsing HTML mails
  • Removing Signature from emails
  • Processing Zoom invites
  • Processing calendar invites
  • Converting emails to conversations

Annotating emails/conversations (Data annotation)

  • Sentiment tagging (-2 to +2) : Need for tagging conversations and why deal outcome cannot be used.
  • Intent tagging : One or more intents tagged from a pool of 50 intents picked by consulting salespersons

Instrumentation

  • Pipeline
    • Ingestion : Kafka consumer, followed by preprocessing and language prediction
    • Population and generation of conversations
    • Sentiment prediction : Get embedding and predict conversation sentiment
    • Use conversation sentiment scores and other features to predict deal sentiment
  • Multi-account models : Using clustering to pick models

Deploying Sentiment (Productionizing data science)

  • Scaling to multiple accounts
  • Clustering customer mails to create buckets

Speaker bio

Machine Learning Engineer, Freshworks

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more