How GO-FOOD built a Query Semantics Engine to help you find food faster

Jul 2019

22 Mon

23 Tue

24 Wed

25 Thu 09:15 AM – 05:45 PM IST

26 Fri 09:20 AM – 05:30 PM IST

27 Sat

28 Sun

Make a submission

Accepting submissions till 15 Jun 2019, 01:00 PM

NIMHANS Convention Centre, Bengaluru

Tickets

Pinned update

The Fifth Elephant Winter edition starts at 9:30 am; live stream for members This update is for participants only

##The eighth edition of The Fifth Elephant will be held in Bangalore on 25 and 26 July. A thousand data scientists, ML engineers, data engineers and analysts will gather at the NIMHANS Convention Centre in Bangalore to discuss:

Model management, including data cleaning, instrumentation and productionizing data science.
Bad data and case studies of failure in building data products.
Identifying and handling fraud + data security at scale
Applications of data science in agriculture, media and marketing, supply chain, geo-location, SaaS and e-commerce.
Feature engineering and ML platforms.
What it takes to create data-driven cultures in organizations of different scales.

##Highlights:

1. Meet Peter Wang, co-founder of Anaconda Inc, and learn about why data privacy is the first step towards robust data management; the journey of building Anaconda; and Anaconda in enterprise.
2. Talk to the Fulfillment and Supply Group (FSG) team from Flipkart, and learn about their work with platform engineering where ground truths are the source of data.
3. Attend tutorials on Deep Learning with RedisAI; TransmorgifyAI, Salesforce’s open source AutoML.
4. Discuss interesting problems to solve with data science in agriculture, SaaS perspective on multi-tenancy in Machine Learning (with the Freshworks team), bias in intent classification and recommendations.
5. Meet data science, data engineering and product teams from sponsoring companies to understand how they are handling data and leveraging intelligence from data to solve interesting problems.

##Why you should attend?

Network with peers and practitioners from the data ecosystem
Share approaches to solving expensive problems such as cleanliness of training data, model management and versioning data
Demo your ideas in the demo session
Join Birds of Feather (BOF) sessions to have productive discussions on focussed topics. Or, start your own Birds of Feather (BOF) session.

##Full schedule published here: https://hasgeek.com/fifthelephant/2019/schedule

##Contact details:
For more information about The Fifth Elephant, sponsorships, or any other information call +91-7676332020 or email info@hasgeek.com

#Sponsors:

Sponsorship Deck.
Email sales@hasgeek.com for bulk ticket purchases, and sponsoring 2019 edition of JSFoo:VueDay.

JSFoo:VueDay 2019 sponsors:

#Platinum Sponsor

#Community Sponsors

#Exhibition Sponsors

#Bronze Sponsor

#Community Sponsors

Hosted by

The Fifth Elephant

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

All submissions

Previous Next

This submission has been added to the schedule

How GO-FOOD built a Query Semantics Engine to help you find food faster

Submitted Apr 10, 2019

Session type: Full talk of 40 mins

Context: The Search problem

GOJEK is a SuperApp: 19+ apps within an umbrella app. One of these is GO-FOOD, the first food delivery service in Indonesia and the largest food delivery service in Southeast Asia. There are over 300 thousand restaurants on the platform with a total of over 16 million dishes between them.

Over two-thirds of those who order food online using GO-FOOD do so by utilising text search. While improving ranking is an extremely important part of enhancing the search experience, understanding that query helps give the searcher exactly what they’re looking for. The semantic neighbours of the query itself become the focus of the search process: after all, if I don’t understand what you’re trying to ask for, how will I give you what you want?

Query Understanding: What & Why

This is where Query Understanding comes into the picture: it’s about using NLP to correctly identify the search intent behind the query and return more relevant search results. GO-FOOD uses the ElasticSearch stack, which results in only exact text matches and/or fuzzy matches. We wanted to create a holistic search experience that not only personalised search results, but also retrieved restaurants and dishes that were more relevant to what the user was looking for.

In the duration of this talk, you will learn about how we are taking advantage of word embeddings to build a Query Understanding Engine that is holistically designed to make the customer’s experience as smooth as possible. I will go over the techniques we used to build each component of the engine, the data and algorithmic challenges we faced and how we solved each problem we came across.

Learning Objectives

The primary objective of the talk is for you to learn why deriving query semantics is essential to building a great search engine, and how you can go about building a Query Semantics Engine.

You will learn about how to:

Take advantage of word embeddings for building an intelligent search engine
Deal with data challenges
Choose from various metrics when evaluating performance of a Search Engine

I will walk you through the journey of how we chose the solution we have, and why it made the most sense in our context.

Outline

Defining the context for the search problem
Why we need a Query Semantics Engine and how it cac add value
Existing workflow and what was proposed
Inside the Query Semantics Engine: what the components are and how they fit into the picture
Building the components: two of the most important components of the query understanding workflow are Intent Classification and Query Expansion: in this talk I will focus on Query Expansion using word embeddings and enhancing the search results with the help of Intent Classification. I will also talk about Spell Correction as a preprocessing step.
How we brought all the components together when building the ElasticSearch Query
Overview of what kind of results were surfaced to the end user

Requirements

An interest in the Search problem and a curiosity to find out what goes on behind the scenes. A basic understanding of the following would be useful:

What word embeddings are and how the vector representations work
Building ElasticSearch queries using the DSL

Speaker bio

Ishita has been working as a Data Scientist since 2016 with product-based startups in understanding business concerns in various domains and formulating them as technical problems that can be solved using data and ML. Her current work at GO-JEK involves end-to-end development of ML projects, by working as part of a product team in defining, prototyping and implementing data science models within the product. She has also published a book on “Applied Supervised Learning with Python” with publisher Packt.

Ishita has completed her Masters’ degree in High Performance Computing with Data Science from the University of Edinburgh, UK and her Bachelors’ degree with Honours in Physics from St. Stephen’s College, Delhi.