The Fifth Elephant Open Source AI Hackathon 2024
The Fifth Elephant For members

The Fifth Elephant Open Source AI Hackathon 2024

GenAI makers and creators contest and showcase

Make a submission

Accepting submissions till 15 Feb 2024, 11:00 PM

Hasura, Bangalore

Tickets

Loading…

Overview

The Fifth Elephant Open Source AI Hackathon started on 5 January 2024 and reached its finale with a Demo Day event on 12 April 2024, when the winners of the two month long contest were chosen.

The aim of this hackathon was to encourage individuals/teams to apply and incubate innovative AI ideas/use cases and publish them as open source projects.

  • The hackathon contest participants worked with mentors for over two months to refine their ideas, and advance them to a stage where they are viable projects that could be pursued beyond the hackathon.
  • the project teams worked on AI’s application in education, accessibility, creative expression, scientific research, languages, under the overall theme of AI for India.
  • competing projects were judged on impact and relevance, innovation and creativity, technical soundness and code quality, scope of expansion, reusability and ease of adoption

As a campaign to raise awareness and drive up developer adoption of AI and open source technologies, the hackathon was a great success. It helped shine light on the agility that open source technology enables for creative and innovative developers.

Demo day finale

Testimonials

“...each one of the contestants put in tremendous effort. And we saw the passion in every person, trying to do things not for winning, but about really building your projects. After a long time, I am attending such a hackathon where young folks are so passionate about building. Kudos to all of you”.
- Rukma Talwadker, Jury Member, Senior Principal Scientist at Games 24x7

“I really enjoyed judging all the projects - lot of interesting work. The Fifth Elephant has done a great job with mentoring and curating this hackathon”.
- Tanuja Ganu, Jury Member, Principal RSDE Manager, Microsoft India

“The hallmark of this hackathon was getting younger people to code for a longer period of time as opposed to a typical hackathon which turns out to be about — how do you build the coolest thing in the shortest period of time”.
- Sumod Mohan, mentor.

“What is impressive about this particular hackathon is, it is not just about cool ideas and fancy demos. It is actually about building a product or a software or a model that can live beyond the demo (and contest).”
- Soma Dhavala, team member at Project Seshu

“It was only through putting my ideas to code that I learnt what the specificity of implementing these (LLMs) were. I began my journey with a sense of hope and commitment towards FOSS principles, and the Hackathon only reinforced my belief that collaboration maketh a better product.”
- Sankalp Srivastava, Creator of Project Schematise

Key highlights from the hackathon

During the course of 12 weeks, the hackathon involved:

  1. Started off on 5 January 2024 and invited open source ideas and projects.
  2. Mentorship sessions in February for all project teams. Mentors included Abhishek H Mishra aka Tokenbender, Arvind Saraf, Bharat Shetty, Ramesh Hariharan, Sidharth Ramachandran, Simrat Hanspal, Sumod Mohan and Vinayak Hegde.
  3. The 10 best from 40 applications were chosen for the Demo Showcase.
  4. An involved peer-review process helped further refine projects between March 1st - 15th, followed by extensive rehearsals from April 8th - 10th, 2024.
  5. On Demo Showcase Day - we had project demos from 10 qualifying teams; 5 project winners were chosen on 12 April 2024.

The Prizes

🏆 Five prizes of ₹1,00,000 (One lakh rupees) per theme, were awarded to winning projects.
The prizes for this hackathon have been sponsored by Meta.

Note: Apart from the contest prizes, Microsoft has offered internships to the contestants.

Jury

  1. Ashok Hariharan heads data and business intelligence at United Nations Volunteers.
  2. Rukma Talwadker is Senior principal scientist at Games24x7.
  3. Shubha Shedthikere is a Senior Manager in the Data Science team at Swiggy.
  4. Sunil Abraham is the Public Policy Director for Data Economy and Emerging Tech at Meta, India.
  5. Tanuja Ganu is a Principal RSDE Manager at Microsoft Research India.

Mentors

  1. Abhishek Mishra is a is creator of CodeCherryPop LLM series.
  2. Arvind Saraf is a computer scientist, engineering leader, entrepreneur trained at IIT, MIT and Google.
  3. Simrat Hanspal is currently spearheading AI product strategy at Hasura.
  4. Sumod Mohan is the co-founder and CEO of AutoInfer.

Editors

About The Fifth Elephant

The Fifth Elephant is a community of practitioners, who share feedback on data, AI and ML practices in the industry. If you like the work that The Fifth Elephant does and want to support its activities - review of Papers, Books, building the innovation ecosystem in India through hackathons and conferences - contribute by picking up a membership.

Contact

💬 Post a comment with your questions here, or join The Fifth Elephant Telegram group and the WhatsApp group.

Follow @fifthel on Twitter.

📞 For any inquiries, call The Fifth Elephant at +91-7676332020.

sponsor image

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Supported by

Venue host

Welcome to the events page for events hosted at The Terrace @ Hasura. more

Partner

Microsoft for Startups Founders Hub is a digital ecosystem removing barriers to building a company with free access to technology, coaching, and support for founders in any stage of development. Let us accelerate your startup journey from idea-to-exit. Find out more here: https://startups.microsoft… more

Adithya S K

@Adithya_S_K

Srinidhi Somayaji P

@Srinidhi9113

Achala Nayak

@achalanayak

A versatile open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of domains and languages.

Submitted Feb 15, 2024

Problem Statement:
Training Large Language Models (LLMs) for Indic languages from scratch is costly and impractical. In response, we present a streamlined framework for adapting pre-trained LLMs like Llama and Mixtral8x7b to various languages, utilizing a compact dataset for cross-lingual tasks. Our solution includes fine-tuning and evaluation processes tailored for practical production use cases.

Unique Selling Points (USPs):

  • Mixture of Languages Architecture: Introducing a novel architecture inspired by the “Mixture of Experts” framework in Mixtral8x7b. Our model consists of 5x7b parameter models, each serving as an expert in a specific language (Kannada, Telugu, Tamil, Hindi, and English).

  • High-Quality Synthetic Data: The model is trained on high-quality synthetic data, ensuring efficiency and reducing additional training costs.

  • Adaptive Lora Adapter Swapping: Employing a method to dynamically switch Lora adapters during inference, enabling a single model to excel in various tasks such as RAG Answering, translation, and instruction following.

  • Multilingual Support: The model is designed to be multilingual, proficient in five languages, catering to diverse linguistic requirements.

  • Indic LLM Evaluation Framework: Developed a specialized evaluation framework tailored for assessing the performance of Indic Large Language Models.

Model Architecture:
The proposed architecture draws inspiration from the Mixture of Experts framework, where each expert is bilingually trained in a specific language. This approach significantly reduces inference time, making it conducive to production environments. The dynamic switching of Lora adapters during inference is based on specific use cases, ensuring adaptability for tasks like retail support conversations and translation. Note that the training of other models is currently in progress.

Project Goals:

  1. User-Friendly Interface: Develop a straightforward interface to empower individuals in adapting models to different domains and languages. The inclusion of a graphical user interface (GUI) ensures easy accessibility, making the adaptation process user-friendly.

  2. Cutting-Edge Support: Incorporate the latest advancements in distributed training code, dataset generation, translation code, and all necessary components for seamless adaptation, fine-tuning, evaluation, and deployment of models. This ensures that the framework stays at the forefront of technology, providing users with state-of-the-art tools for their language model adaptation needs.

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Make a submission

Accepting submissions till 15 Feb 2024, 11:00 PM

Hasura, Bangalore

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Supported by

Venue host

Welcome to the events page for events hosted at The Terrace @ Hasura. more

Partner

Microsoft for Startups Founders Hub is a digital ecosystem removing barriers to building a company with free access to technology, coaching, and support for founders in any stage of development. Let us accelerate your startup journey from idea-to-exit. Find out more here: https://startups.microsoft… more