Efficient Machine Translation for low resource languages using Transformers

Nov 2019

18 Mon

19 Tue

20 Wed

21 Thu

22 Fri

23 Sat 08:30 AM – 05:30 PM IST

24 Sun

Taj M G Road, Bangalore, Bangalore

Tickets

All submissions

Efficient Machine Translation for low resource languages using Transformers

Submitted Nov 5, 2019

Session type: Full talk of 40 mins

Transformer is the first transduction model relying entirely on self-attention to compute representations of its input and output without using sequence aligned RNNs or convolution. Transformers were recently used by OpenAI in their language models, and also used recently by DeepMind for AlphaStar, their program to defeat a top professional Starcraft player.

Key Takeaways

Build a translation mechanism for datasets with scarcely available parallel sentence pair corpus to obtain relatively high BLEU scores.

Outline

Section 1.

Transformer Model Architecture
a. Encoder [Theory + Code]
b. Decoder [Theory + Code]
Self-Attention [Theory + Code]
Multi-head Attention [Theory + Code]
Positional Encoding [Theory + Code]
Note on Bleu Score

Section 2.

Solving a Real World Translation Problem with low resource data
Attention Visualization
Translation Results

Requirements

Basic Familiarity with Neural Networks and Linear Algebra.

Speaker bio

Have over 3+ years of industrial experience in Data Science. Currently working as a data scientist (NLP) at niki.ai, where I have built models for Parse Classification, Unsupervised Synonym Detection, Identifying Code Mixing in text, etc. I’ve also participated in numerous data science competitions across Kaggle, AnalyticsVidhya, Topcoder, Crowdanalytix etc and finished in the top 10 in atleast a dozen of those.

Specialties: data science, machine learning, predictive modelling, natural language processing, deep learning, big data, artificial intelligence.

StackOverflow
Linkedin

Slides

https://www.beautiful.ai/player/-LswkeyEBAgBPzM7Kn_R

All submissions

Comments

Nov 2019

18 Mon

19 Tue

20 Wed

21 Thu

22 Fri

23 Sat 08:30 AM – 05:30 PM IST

24 Sun

Hybrid access (members only)

Hosted by

Anthill Inside

Anthill Inside is a forum for conversations about risk mitigation and governance in Artificial Intelligence and Deep Learning. AI developers, researchers, startup founders, ethicists, and AI enthusiasts are encouraged to: more

Anthill Inside 2019

Efficient Machine Translation for low resource languages using Transformers

Outline

Requirements

Speaker bio

Slides

Comments