GAN-inspired Innovations in Computer Vision
Submitted by Pushkar Pushp (@ppushp7) on Tuesday, 30 April 2019
Technical level: Intermediate Session type: Lecture Section: Tutorials
“The most interesting idea in the last 10 years in ML.” - Yann LeCun, Facebook AI research director.
In this talk, we will focus on Generative Adversarial Networks, one of the most interesting concepts in deep learning. A GAN is a generative model, which captures the patterns in the data so that it can generate new data points from the estimated data distribution. In the recent years, there has been tremendous research in the field of GANs, some of which include text-to-image synthesis, photo realistic image generation from doodles and a lot more.
We will cover the working of GANs with implementation and some of these interesting applications in this talk.
Keywords : StackGAN , DCGAN, Generators, Autoencoders, VAE
Generative vs Discriminative Models
Introduction to GAN
How do GANs work?
Generators and Discriminators
Cost function and optimization
GANs vs Autoencoders and VAE
Recent applications/case studies of GANs
Pose Guided Person Image Generation
Nvidia’s GauGAN (Doodles into photo realistic images)
We will also cover an implementation of DCGAN using Jupyter notebook and keras for better understanding of the implementation and the concept.
Basic understanding of deep learning and how neural networks are trained. Beginner level knowledge about Python and Keras will be helpful in understanding the concepts more efficiently.
Pushkar Pushp is working as a Data Scientists with WalmartLabs having done his graduation and masters in statistics from ISI, Kolkata. His areas of interests range from pure Mathematics, Python to Computer Vision, Deep Learning. He has extensively work on Keras/tensorflow to develop various state of art models such as Face Recognition,Trigger Word detection ,Machine Translation and other sequence models.
I have a Master’s degree in Information Technology with a Data Science major from IIIT Bangalore. Currently, I am working on Computer Vision as a Statistical Analyst at Walmart Labs India. With projects that make use of different ML techniques like object detection, GANs, CNNs, recommendation systems, I have worked with Machine Learning for the past 4 years. I also have a provisionally filed patent titled ‘System and method for produce detection and classification’ for an image classification algorithm.