Making Deep Neural Networks smaller and faster

Jul 2016

27 Mon

28 Tue

29 Wed

30 Thu

1 Fri 08:45 AM – 06:15 PM IST

2 Sat 08:15 AM – 02:15 PM IST

3 Sun

CMRIT College

All submissions

Previous Next

This submission has been added to the schedule

Making Deep Neural Networks smaller and faster

Submitted May 31, 2016

Section: Crisp talk Technical level: Intermediate

Deep neural networks with millions of parameters are at the heart of many state of the art machine learning models today. However, is has been shown that models with much smaller number of parameters can also perform just as well. A smaller model has the advantage of being faster to evaluate and easier to store - both of which are crucial for real-time and embedded / mobile applications. In this talk, I intend to provide a brief overview of such model compression techniques. Using these techniques, it is possible to compress neural networks by as much as 10x and speed up inference by 3-4x.

Outline

First, I shall motivate the general problem of model compression and it’s relevance for real-world applications.
Then, I shall provide overviews of the following papers:

Learning both Weights and Connections for Efficient Neural Networks, NIPS 2015
Deep Compression, ICLR 2016
Learning the Architecture of Deep Neural Networks, arxiv 2016

Requirements

Familiarity with Convolutional Neural Networks

Speaker bio

Suraj is a second year Master’s student at Video Analytics Lab, Indian Institute of Science. From the past one and a half years, he has been working on the problem of model compression. His work has been presented previously at British Machine Vision Conference (BMVC) - 2015.

Comments

Jul 2016

27 Mon

28 Tue

29 Wed

30 Thu

1 Fri 08:45 AM – 06:15 PM IST

2 Sat 08:15 AM – 02:15 PM IST

3 Sun

Hosted by

The Fifth Elephant

Jump starting better data engineering and AI futures

Deep Learning Conf 2016

Making Deep Neural Networks smaller and faster

Outline

Requirements

Speaker bio

Links

Comments