Hack Five For members

The Fifth Elephant Open Source AI Hackathon 2024

GenAI makers and creators contest and showcase

Make a submission

Accepting submissions till 15 Feb 2024, 11:00 PM

Hasura, Bangalore

Tickets

Loading…

Overview

The Fifth Elephant Open Source AI Hackathon started on 5 January 2024 and reached its finale with a Demo Day event on 12 April 2024, when the winners of the two month long contest were chosen.

The aim of this hackathon was to encourage individuals/teams to apply and incubate innovative AI ideas/use cases and publish them as open source projects.

  • The hackathon contest participants worked with mentors for over two months to refine their ideas, and advance them to a stage where they are viable projects that could be pursued beyond the hackathon.
  • the project teams worked on AI’s application in education, accessibility, creative expression, scientific research, languages, under the overall theme of AI for India.
  • competing projects were judged on impact and relevance, innovation and creativity, technical soundness and code quality, scope of expansion, reusability and ease of adoption.

📋 Read Hack5 V1 Report

As a campaign to raise awareness and drive up developer adoption of AI and open source technologies, the hackathon was a great success. It helped shine light on the agility that open source technology enables for creative and innovative developers.

Open Source AI Hackathon Winners

Testimonials

“...each one of the contestants put in tremendous effort. And we saw the passion in every person, trying to do things not for winning, but about really building your projects. After a long time, I am attending such a hackathon where young folks are so passionate about building. Kudos to all of you”.
- Rukma Talwadker, Jury Member, Senior Principal Scientist at Games 24x7

“I really enjoyed judging all the projects - lot of interesting work. The Fifth Elephant has done a great job with mentoring and curating this hackathon”.
- Tanuja Ganu, Jury Member, Principal RSDE Manager, Microsoft India

“The hallmark of this hackathon was getting younger people to code for a longer period of time as opposed to a typical hackathon which turns out to be about — how do you build the coolest thing in the shortest period of time”.
- Sumod Mohan, mentor.

“What is impressive about this particular hackathon is, it is not just about cool ideas and fancy demos. It is actually about building a product or a software or a model that can live beyond the demo (and contest).”
- Soma Dhavala, team member at Project Seshu

“It was only through putting my ideas to code that I learnt what the specificity of implementing these (LLMs) were. I began my journey with a sense of hope and commitment towards FOSS principles, and the Hackathon only reinforced my belief that collaboration maketh a better product.”
- Sankalp Srivastava, Creator of Project Schematise

Key highlights from the hackathon

During the course of 12 weeks, the hackathon involved:

  1. Started off on 5 January 2024 and invited open source ideas and projects.
  2. Mentorship sessions in February for all project teams. Mentors included Abhishek H Mishra aka Tokenbender, Arvind Saraf, Bharat Shetty, Ramesh Hariharan, Sidharth Ramachandran, Simrat Hanspal, Sumod Mohan and Vinayak Hegde.
  3. The 10 best from 40 applications were chosen for the Demo Showcase.
  4. An involved peer-review process helped further refine projects between March 1st - 15th, followed by extensive rehearsals from April 8th - 10th, 2024.
  5. On Demo Showcase Day - we had project demos from 10 qualifying teams; 5 project winners were chosen on 12 April 2024.

The Prizes

🏆 Five prizes of ₹1,00,000 (One lakh rupees) per theme, were awarded to winning projects.
The prizes for this hackathon have been sponsored by Meta.

Note: Apart from the contest prizes, Microsoft has offered internships to the contestants.

Jury

  1. Ashok Hariharan heads data and business intelligence at United Nations Volunteers.
  2. Rukma Talwadker is Senior principal scientist at Games24x7.
  3. Shubha Shedthikere is a Senior Manager in the Data Science team at Swiggy.
  4. Sunil Abraham is the Public Policy Director for Data Economy and Emerging Tech at Meta, India.
  5. Tanuja Ganu is a Principal RSDE Manager at Microsoft Research India.

Mentors

  1. Abhishek Mishra is a is creator of CodeCherryPop LLM series.
  2. Arvind Saraf is a computer scientist, engineering leader, entrepreneur trained at IIT, MIT and Google.
  3. Simrat Hanspal is currently spearheading AI product strategy at Hasura.
  4. Sumod Mohan is the co-founder and CEO of AutoInfer.

Editors

About The Fifth Elephant

The Fifth Elephant is a community of practitioners, who share feedback on data, AI and ML practices in the industry. If you like the work that The Fifth Elephant does and want to support its activities - review of Papers, Books, building the innovation ecosystem in India through hackathons and conferences - contribute by picking up a membership.

Contact

💬 Post a comment with your questions here, or join The Fifth Elephant Telegram group and the WhatsApp group.

Follow @fifthel on Twitter.

📞 For any inquiries, call The Fifth Elephant at +91-7676332020.

sponsor image

Hosted by

The Fifth Elephant hackathons

Supported by

Host

All about data science and machine learning

Venue host

Welcome to the events page for events hosted at The Terrace @ Hasura. more

Partner

Providing all founders, at any stage, with free resources to build a successful startup.

Akshobhya

@akshobhya_j

Bharat Shetty Barkur

@ctangent Editor

Resources for Open-Source AI Hackathon

Submitted Jan 29, 2024

This submission contains a list of knowledge resources shared by the curator of The Fifth Elephant Open-Source AI hackathon, Bharat Shetty Barkur.

You will find interesting links here to spark creativity in your projects and aid in your idea development. If you have any interesting links to share, do add them as comments in this submission.

This submission will be updated daily until the hack day on 3 February. Do check back occasionally for updated resources.

  1. For those who are looking to leverage Microsoft’s semantic kernel along with LLMs: https://towardsdatascience.com/a-pythonistas-intro-to-semantic-kernel-af5a1a39564d

  2. https://github.com/pandeyanuradha/Chatbot-for-mental-health Folks, keep checking out stuff like Kaggle and GitHub for any interesting datasets that can help you in several use-cases like this one. This one uses a Kaggle dataset (less number of data 98 FAQs) But a RAG + LLM approach can also be tried out here etc.

Make sure that your hackathon descriptions detail out the nuances like this. It will help you to break down the main project to actionable small sets of items in the longer run.

  1. I had worked on a course project during my college time on generating molecules for drug discovery. Had referred to this paper then https://arxiv.org/pdf/2001.08184.pdf . Table 3 in the paper mentions some references for datasets. Not sure if your idea is slightly different but maybe the datasets are worth a look

  2. https://cloud.google.com/blog/topics/healthcare-life-sciences/building-a-clinical-intelligence-engine-using-medlm

For those who are looking for datasets in the healthcare mimic dataset an example dataset of open EHR. This Blog can give ideas that can be executed in open manner

  1. Saraswati has shared some datasets that might help for hackathon.
    https://cloud.google.com/healthcare-api/docs/resources/public-datasets
    https://www.kaggle.com/datasets/uciml/breast-cancer-wisconsin-data
    https://data.gov.in/keywords/Diagnostic

  2. https://arxiv.org/pdf/2303.18223.pdf check out this for survey of LLMs

  3. Also, check out this dataset that I found - https://huggingface.co/datasets/knowrohit07/know_medical_dialogues for anyone who wants to finetune and then try some nifty use-cases on top of this.

  4. @Saraswati Chandra has suggested these ideas for the hackathon
    Assuming this is in India context, here are some ideas for health related projects:
    ML
    Currently are predictively good at intent and pattern recognition
    Focus Area - Improving technicians efficiency OR resource deployment (public health)
    Potential Use cases
    a. Nurse hiring and deployment for villages (PH problem)
    b. ASHA workers route map plan
    c. Infectious disease (Malaria) potential harm map
    d. Breast cancer screening
    LLM
    Currently are predictively good at user interaction
    Focus area — Customer Care Journey OR customer Info (Public Health)
    Potential Use Cases

  • Vaccine Information & Reminders
  • Maternal care journey interaction
  • Post-Cancer Care
  • Pre&Post Clinic Visit interaction mgmt
  1. https://github.com/langroid/Awesome-LLM?tab=readme-ov-file#open-llm Folks, since many folks were asking what are the open src llms, take a look here for some of them
    10.https://arvindsaraf.medium.com/technology-for-impact-9b1c2c2c2934

    https://arvindsaraf.medium.com/regulating-ai-1aa732d8f82e
    Between them, one can pull up some ideas.

  2. Quality set of articles and notebooks on LLMs - https://github.com/ghimiresunil/LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing Please go through these resources - some of them will be helpful for sure.

  3. An example of quick hacks that folks are doing with LLMs and prompt engineering - https://arxiv.org/pdf/2401.14447.pdf

  4. https://github.com/poloclub/wordflow/ check out this to see how nicely they orchestrated the ideas/code/roadmap etc.

  5. https://lilacai-lilac.hf.space/datasets#lilac/OpenHermes-2.5&rowId="0000215f-9b07-46da-a8fa-b23aa28f1ba3" since datasets are useful for LLM pre-training/finetuning open source projects, this is a good example of app that will understand structure of these datasets.

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Make a submission

Accepting submissions till 15 Feb 2024, 11:00 PM

Hasura, Bangalore

Hosted by

The Fifth Elephant hackathons

Supported by

Host

All about data science and machine learning

Venue host

Welcome to the events page for events hosted at The Terrace @ Hasura. more

Partner

Providing all founders, at any stage, with free resources to build a successful startup.