Anthill Inside 2018

Anthill Inside 2018

On the current state of academic research, practice and development regarding Deep Learning and Artificial Intelligence.

##About the conference and topics for submitting talks:
In 2016, The Fifth Elephant branched into a separate conference on Deep Learning. The Deep Learning Conference has grown in to a large community under the brand Anthill Inside.

Anthill Inside features talks, panels and Off The Record (OTR) sessions on current research, technologies and developments around Artificial Intelligence (AI) and Deep Learning. Submit proposals for talks and workshops on the following topics:

  1. Theoretical concepts in Deep Learning, AI and Machine Learning – and how these have been applied in real life situations / specific domains. In 2017, we covered GANS, Reinforcement Learning and Transfer Learning. We seek speakers from academia who can communicate these concepts to an audience of practitioners.
  2. Latest tools, frameworks, libraries – either as short talks demonstrating these, or as full talks explaining why you chose the technology, including comparisons made and metrics used in evaluating the choice.
  3. Application of Computer Vision, NLP, speech recognition, video analytics and voice-to-speech in a specific domain or for building product. We are also interested in talks on application of Deep Learning to hardware and software problems / domains such as GPUs, self-driving cars, etc.
  4. Case studies of AI / Deep Learning and product: the journey of arriving at the product, not an elaboration of the product itself. We’d also like to understand why you chose AI, Deep Learning or Machine Learning for your use case.

##Perks for submitting proposals:
Submitting a proposal, especially with our process, is hard work. We appreciate your effort.
We offer one conference ticket at discounted price to each proposer, and a t-shirt.
We only accept one speaker per talk. This is non-negotiable. Workshops may have more than one instructor.
In case of proposals where more than one person has been mentioned as collaborator, we offer the discounted ticket and t-shirt only to the person with who the editorial team corresponded directly during the evaluation process.

##Target audience:
We invite beginner and advanced participants from:

  1. Academia,
  2. Industry and
  3. Startups,

to participate in Anthill Inside. At the 2018 edition, tracks will be curated separately for beginner and advanced audiences.

Developer evangelists from organizations which want developers to use their APIs and technologies for deep learning and AI should participate, speak and/or sponsor Anthill Inside.

Anthill Inside is a two-day conference with two tracks on each day. Track details will be announced with a draft schedule in February 2018.

We are accepting sessions with the following formats:

  1. Crisp (20 min) and full (40 min) talks.
  2. OTR sessions on focussed topics / questions. An OTR is 1 to 1.5 hours long and typically has four facilitators including or excluding one moderator.
  3. Workshops and tutorials of 3-6 hours duration on Machine Learning concepts and tools, full stack data engineering, and data science concepts and tools.
    4. Birds Of Feather (BOF) sessions, talks and workshops for open houses and pre-events in Bangalore and other cities between October 2017 and June 2018. We have events open round the year. Reach out to us on should you be interested in speaking and/or hosting a community event between now and the conference in July 2018.

##Selection criteria:
The first filter for a proposal is whether the technology or solution you are referring to is open source or not. The following criteria apply for closed source talks:

  1. If the technology or solution is proprietary, and you want to speak about your propritary solution to make a pitch to the audience, you should pick up sponsored session. This involves paying for the speaking slot. Write to
  2. If the technology or solution is in the process of being open sourced, we will consider the talk only if the solution is open sourced at least three months before the conference.
  3. If your solution is closed source, you should consider proposing a talk explaining why you built it in the first place; what options did you consider (business-wise and technology-wise) before making the decision to develop the solution; or, what is your specific use case that left you without existing options and necessitated creating the in-house solution.

The criteria for selecting proposals, in the order of importance, are:

  1. Key insight or takeaway: what can you share with participants that will help them in their work and in thinking about the ML, big data and data science problem space?
  2. Structure of the talk and flow of content: a detailed outline – either as mindmap or draft slides or textual decription – will help us understand the focus of the talk, and the clarity of your thought process.
  3. Ability to communicate succinctly, and how you engage with the audience. You must submit link to a two-minute preview video explaining what your talk is about, and what is the key takeaway for the audience.

No one submits the perfect proposal in the first instance. We therefore encourage you to:

  1. Submit your proposal early so that we have more time to iterate if the proposal has potential.
  2. Talk to us on our community Slack channel: if you want to discuss an idea for your proposal, and need help / advice on how to structure it.

Our editorial team helps potential speakers in honing their speaking skills, fine tuning and rehearsing content at least twice - before the main conference - and sharpening the focus of talks.

##How to submit a proposal (and increase your chances of getting selected):
The following guidelines will help you in submitting a proposal:

  1. Focus on why, not how. Explain to participants why you made a business or engineering decision, or why you chose a particular approach to solving your problem.
  2. The journey is more important than the solution you may want to explain. We are interested in the journey, not the outcome alone. Share as much detail as possible about how you solved the problem. Glossing over details does not help participants grasp real insights.
  3. Focus on what participants from other domains can learn/abstract from your journey / solution. Refer to these talks, from some of HasGeek’s other conferences, which participants liked most:
  4. We do not accept how-to talks unless they demonstrate latest technology. If you are demonstrating new tech, show enough to motivate participants to explore the technology later. Refer to talks such as this: to structure your proposal.
  5. Similarly, we don’t accept talks on topics that have already been covered in the previous editions. If you are unsure about whether your proposal falls in this category, drop an email to:
  6. Content that can be read off the internet does not interest us. Our participants are keen to listen to use cases and experience stories that will help them in their practice.

To summarize, we do not accept talks that gloss over details or try to deliver high-level knowledge without covering depth. Talks have to be backed with real insights and experiences for the content to be useful to participants.

##Passes and honorarium for speakers:
We pay an honararium of Rs. 3,000 to each speaker and workshop instructor at the end of their talk/workshop. Confirmed speakers and instructors also get a pass to the conference and networking dinner. We do not provide free passes for speakers’ colleagues and spouses.

##Travel grants for outstation speakers:
Travel grants are available for international and domestic speakers. We evaluate each case on its merits, giving preference to women, people of non-binary gender, and Africans. If you require a grant, request it when you submit your proposal in the field where you add your location. Anthill Inside is funded through ticket purchases and sponsorships; travel grant budgets vary.

##Last date for submitting proposals is: 15 April 2018.
You must submit the following details along with your proposal, or within 10 days of submission:

  1. Draft slides, mind map or a textual description detailing the structure and content of your talk.
  2. Link to a self-recorded, two-minute preview video, where you explain what your talk is about, and the key takeaways for participants. This preview video helps conference editors understand the lucidity of your thoughts and how invested you are in presenting insights beyond the solution you have built, or your use case. Please note that the preview video should be submitted irrespective of whether you have spoken at previous editions of Anthill Inside.
  3. If you submit a workshop proposal, you must specify the target audience for your workshop; duration; number of participants you can accommodate; pre-requisites for the workshop; link to GitHub repositories and a document showing the full workshop plan.

##Contact details:
For information about the conference, sponsorships and tickets contact or call 7676332020. For queries on talk submissions, write to

Hosted by

Anthill Inside is a forum for conversations about risk mitigation and governance in Artificial Intelligence and Deep Learning. AI developers, researchers, startup founders, ethicists, and AI enthusiasts are encouraged to: more

Kalpit Desai


The Catalog as a Catalyst - Bringing benefits of Big Data to MSMEs

Submitted Apr 12, 2018

While large enterprises have the necessary resources to acquire and process Big Data, the Micro / Small / Medium enterprises in emerging economies like India are far from being ‘data-driven’. This is a huge opportunity untapped, considering that MSMEs account for more than 99% of businesses, and they make up the backbone of our economy. For the opportunity to be leveraged, a crucial pre-requisite is that the MSMEs transactions be anchored onto a common ‘reference’ data. The ‘Product Catalog’ is one such reference data containing rich semantics about all products being transacted by MSMEs.

However, building and maintaining such catalogs especially for MSMEs is a herculean task in itself, owing to several complex challenges. First, the product universe for MSMEs is very diverse. Even at the top-level classification of finished goods alone, there are nearly hundred industry segments. Moreover, while B2C companies transact in finished goods, B2B transactions happen in raw material, intermediate artifacts, and parts all of which combine to make a consumer good. Hence, the size of the product catalog in which MSMEs operate, is perhaps hundreds of times larger than the size of the catalog operated by, let’s say the e-commerce sector. Second, there is a vast disorganization in terms of product representation. A pencil may be represented by a manufacturer as ‘Natraj Pencil hardness HB, shape=Octagone’; whereas the same pencil may be denoted by a retailer simply as ‘Pencils’. In addition, there is the issue of multilingual representations, given that India has more than 20 regional languages and even more local dialects. And often, the MSME owners / data operators aren’t familiar with English. Third, the catalog needs to cover the product universe transacted by a huge number of businesses of varying scale. By government census, there are around 6 million registered businesses in India, and 99% of them are MSMEs. Each business records and structures their data uniquely to suit their individual needs, and because no standardization has been enforced.

Keeping in mind the nature and scale of the problem, this talk will present innovative approaches to tackling a few of the challenges in building a product catalog for MSMEs. These solutions rely on techniques ranging from heuristics, string match to conditional random fields, evidence theory, and semantic graph mining, to name a few.


For the big data analytics to be leveraged by MSMEs, a crucial pre-requisite is that their transactions be anchored onto a common ‘reference’ data. The ‘Product Catalog’ is one such reference data containing rich semantics about all products being transacted by MSMEs. This talk will present innovative approaches to tackling a few of the challenges in building a product catalog for MSMEs.

Speaker bio

Kalpit V. Desai is the Director of Data Science at Clustr. Prior to Clustr, Kalpit has gained over 14 years of experience building the core algorithms for data products in variety of settings ranging from an academic lab CISMM to a multinational conglomerate GE to a start-up Bidgely. His core expertize is in building intelligent software systems based on statistical inference, pattern recognition and machine learning. He is passionate about making use of data and algorithms to make our world a better place. Kalpit holds PhD from The University of North Carolina at Chapel Hill, USA and has numerous patents and peer-reviewed publications at international journals in the field of data science. He leads a prize-winning team in the IEEE data mining contest ICMD 2011. When the clock is ticking a bit slower, Kalpit enjoys family time, chess, non-fiction, and often advising budding businesses on their data strategy.



{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

Anthill Inside is a forum for conversations about risk mitigation and governance in Artificial Intelligence and Deep Learning. AI developers, researchers, startup founders, ethicists, and AI enthusiasts are encouraged to: more