Attention based sequence to sequence models for natural language processing

Nov 2019

18 Mon

19 Tue

20 Wed

21 Thu

22 Fri

23 Sat 08:30 AM – 05:30 PM IST

24 Sun

Make a submission

Accepting submissions till 01 Nov 2019, 04:20 PM

Taj M G Road, Bangalore, Bangalore

Tickets

##About the 2019 edition:

The schedule for the 2019 edition is published here: https://hasgeek.com/anthillinside/2019/schedule

The conference has three tracks:

Talks in the main conference hall track
Poster sessions featuring novel ideas and projects in the poster session track
Birds of Feather (BOF) sessions for practitioners who want to use the Anthill Inside forum to discuss:

Myths and realities of labelling datasets for Deep Learning.
Practical experience with using Knowledge Graphs for different use cases.
Interpretability and its application in different contexts; challenges with GDPR and intepreting datasets.
Pros and cons of using custom and open source tooling for AI/DL/ML.

#Who should attend Anthill Inside:

Anthill Inside is a platform for:

Data scientists
AI, DL and ML engineers
Cloud providers
Companies which make tooling for AI, ML and Deep Learning
Companies working with NLP and Computer Vision who want to share their work and learnings with the community

For inquiries about tickets and sponsorships, call Anthill Inside on 7676332020 or write to sales@hasgeek.com

#Sponsors:

Sponsorship slots for Anthill Inside 2019 are open. Click here to view the sponsorship deck.

Anthill Inside 2019 sponsors:

#Bronze Sponsor

#Community Sponsor

Hosted by

Anthill Inside

Anthill Inside is a forum for conversations about risk mitigation and governance in Artificial Intelligence and Deep Learning. AI developers, researchers, startup founders, ethicists, and AI enthusiasts are encouraged to: more

All submissions

Previous Next

Attention based sequence to sequence models for natural language processing

Submitted Apr 26, 2019

Section: Workshops Technical level: Intermediate Session type: Workshop

##Workshop details including schedule, venue, date and tickets are published here: https://hasgeek.com/anthillinside/sequence-to-sequence-models-workshop/

Ilya Sutskever and others introduced sequence to sequence learning with neural networks. Subsequently, Bahdanau and others introduced “attention”, similar to the human ability to focus with high resolution on a certain part, to improve the performance of sequence to sequence models in machine translation. Later, Vaswani and others introduced the transformer model which is built entirely on the idea of “self-attention”. These ideas have proved to be very useful in practice for building powerful natural language processing models (https://ai.googleblog.com/2016/09/a-neural-network-for-machine.html). In this hands on workshop using PyTorch, we will learn to build natural language processing models using these concepts.

Outline

Introduction to sequence models
Why sequence to sequence models? Build a sequence to sequence model on sample data.
What is attention? Enhance the model and understand the value of attention
Transformer architecture: sequence to sequence modeling using self-attention
Build a transformer model on sample data

Requirements

Laptop

Speaker bio

Madhu Gopinathan is currently Vice President, Data Science at MakeMyTrip (MMT), India’s leading online travel company. At MakeMyTrip, he led the development of natural language processing models for Myra, MMT’s task bot for customer service (https://economictimes.indiatimes.com/jobs/rise-of-the-machines-when-bots-take-over-the-workplace/articleshow/66930068.cms).
Madhu holds a PhD in computer science from Indian Institute of Science, on mathematical modelling of software systems,and an MS in computer science from the University of Florida, Gainesville, USA.. He has collaborated with researchers at Microsoft Research, General Motors and Indian Institute of Science leading to publications in prominent computer science conferences.
He has extensive experience developing large scale systems using machine learning & natural language processing and has been granted multiple US patents.

Links