Open Source AI Hackathon

Open Source AI Hackathon

The Fifth Elephant Winter Edition Hackathon

Make a submission

Accepting submissions till 15 Feb 2024, 11:00 PM

Microsoft Reactor Bengaluru, Bengaluru

About the hackathon

The aim of this hackathon is to encourage individuals/teams to apply and develop innovative AI ideas/use cases and publish them as open source projects.

Who can participate

  1. Working professionals
  2. Students
  3. Independent consultants
  4. AI researchers
  5. ML engineers
  6. Lawyers, doctors, agronomists, artists, and others who are keen to collaborate with technologists, and showcase ideas and working demos.

Criteria for submitting projects

  1. Ideas should be open source.
  2. Code should be open source with a permissive open-source LICENSE file added.
  3. Orchestrate your code in such a way that it works with open-source models (pre-trained and fine-tuned), open-source products, platforms, systems, and tools.

How to participate

  1. Submit your project idea and outline here.
  2. Join The Fifth Elephant WhatsApp group to discuss your submission with the mentors.
    Or, if you want to validate your idea/project before submitting it, you can discuss it with the mentors, either in the WhatsApp group or on DM.
  3. Participants should work on their projects and start building soon after submitting ideas. Participants have the entire month of February to work on their projects. The last date for submitting projects is 28 February.
  4. Mentors will be assigned to projects which are shortlisted. Inactive projects, or projects that are not in the consideration list will not be assigned mentors.
  5. Mentors will comment on the submissions during the period - all through Febryart. The reward of the hackathon is the feedback, not just the cash prize.
  6. Demo day for all shortlisted hackathon projects — in person and remote — will be on 10 March. The jury will review the submissions and announce prize winners.

Mentors

  • Bharat Shetty is an AI/ML Consultant. He has worked for Airtel Labs and other organizations on AI/ML/NLP platforms and products, across diverse verticals such as conversational AI, EdTech, IOT, and healthcare. Bharat is the editor of The Fifth Elephant Winter edition, and papers discussion community.

  • Abhishek Mishra is a PSF Fellow and software engineering enthusiast, driving tech events like PyCon India, APAC, Chaos Carnival Conference, and GDG, dedicated to fostering community-centric initiatives.

  • Aniket Maurya is spearheading the creation of intelligent software using AI, serving as a Developer Advocate at Lightning AI ⚡️, and is the creator of GradsFlow.

  • Simrat Hanspal has a career spanning over a decade in the AI ML space, specializing in Natural Language Processing. Currently spearheading AI product strategy at Hasura and has led AI teams at renowned organizations such as VMware, FI Money, and Nirvana Insurance in the past.

  • Sumod Mohan is the co-founder and C.E.O of stealth startup AutoInfer Private limited. He is also technical Advisor and previously CTO of Niqo Robotics where he helped build robots to remove weeds from agricultural farms. This work won the Ministry of Electronics and Information technology (MeitY) and Niti Ayog’s RAISE 2020 Challenge in the Agriculture sector. He was an Advisor to WebCardio, AI based Holter manufacturer (wearable ECG) and led the Computer Vision Division at Soliton Technologies. He was also CTO of Digital Aristotle, which was acquired by Byjus. He has over 15 years of research experience in Computer Vision and over 10 in productizing these technologies in the US and India. Prior to this he worked for HighlightCam Inc, a startup in California where he led Computer Vision Algorithm Development. He holds an M.S degree from Clemson University, USA with a specialization in Intelligent Systems and Robotics.

Editors

  • Bharat Shetty is an AI/ML Consultant. He has worked for Airtel Labs and other organizations on AI/ML/NLP platforms and products, across diverse verticals such as conversational AI, EdTech, IOT, and healthcare. Bharat is the editor of The Fifth Elephant Winter edition, and papers discussion community.
  • Akshobhya Jamadagni is Editorial Assistant for The Fifth Elephant Open Source AI Hackathon. He is passionate about contributing value across various levels of abstraction, from high-level technical strategy to detailed implementation.

Team composition

  1. You can submit your project as an individual.
  2. Team size is restricted to a maximum of 3 members.
  3. Add your teammates as collaborators after submitting your idea.

Ideas for the hackathon

Participants can propose projects around some of the following ideas:

  1. AI for Scientific Research: e.g. Protein folding models, climate models, drug discovery, image recognition for scientific research, simulations for material science, epidemiology, and more.
  2. AI for inclusivity and accessibility: e.g. STT/TTS, automated audio descriptions (for non-voice content), automated color blindness correction, AI-powered sign language generation, real-time AI-powered captioning display for events, educational resources, and content translation across languages by leveraging multi-lingual models, adaptive content for differences in learning ability and/or neurodivergence, etc.
  3. AI and creative expression: e.g., generative audio, video, text, and visuals and ways to combine these in a production-oriented direction, including AR/VR/Gaming and OTT implementations.
  4. AI in education: e.g., personalized learning plans, adaptive learning plans, content creation, translation with context, AI tutors, productivity tools, well-being improvement tools, etc.
  5. AI for India: for e.g., India-specific law, models that focus on indic languages, renewable energy optimization, disaster response and relief, and education accessibility.
  6. Additionally, participants can also pick and work on ideas from the list of ideas submitted in this spreadsheet.

Jury - to be announced

Project Evaluation Criteria

Project Evaluation Criteria Presentation

Prizes

Five prizes of ₹1,00,000 (One lakh rupees) per theme, will be awarded to winners at the hackathon.

About The Fifth Elephant

The Fifth Elephant is a community funded organization. If you like the work that The Fifth Elephant does and want to support meet-ups and activities - online and in-person - contribute by picking up a membership

Contact information

If you have questions about hackathon, post a comment here, or join The Fifth Elephant Telegram group and the WhatsApp group.

Follow @fifthel on Twitter.

For any inquiries, call The Fifth Elephant at +91-7676332020.

Sponsored by Meta

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Supported by

Partner

Microsoft for Startups Founders Hub is a digital ecosystem removing barriers to building a company with free access to technology, coaching, and support for founders in any stage of development. Let us accelerate your startup journey from idea-to-exit. Find out more here: https://startups.microsoft… more

Akshobhya

@akshobhya_j

Bharat Shetty Barkur

@cerebraltangent Editor

Resources for Open-Source AI Hackathon

Submitted Jan 29, 2024

This submission contains a list of knowledge resources shared by the curator of The Fifth Elephant Open-Source AI hackathon, Bharat Shetty Barkur.

You will find interesting links here to spark creativity in your projects and aid in your idea development. If you have any interesting links to share, do add them as comments in this submission.

This submission will be updated daily until the hack day on 3 February. Do check back occasionally for updated resources.

  1. For those who are looking to leverage Microsoft’s semantic kernel along with LLMs: https://towardsdatascience.com/a-pythonistas-intro-to-semantic-kernel-af5a1a39564d

  2. https://github.com/pandeyanuradha/Chatbot-for-mental-health Folks, keep checking out stuff like Kaggle and GitHub for any interesting datasets that can help you in several use-cases like this one. This one uses a Kaggle dataset (less number of data 98 FAQs) But a RAG + LLM approach can also be tried out here etc.

Make sure that your hackathon descriptions detail out the nuances like this. It will help you to break down the main project to actionable small sets of items in the longer run.

  1. I had worked on a course project during my college time on generating molecules for drug discovery. Had referred to this paper then https://arxiv.org/pdf/2001.08184.pdf . Table 3 in the paper mentions some references for datasets. Not sure if your idea is slightly different but maybe the datasets are worth a look

  2. https://cloud.google.com/blog/topics/healthcare-life-sciences/building-a-clinical-intelligence-engine-using-medlm

For those who are looking for datasets in the healthcare mimic dataset an example dataset of open EHR. This Blog can give ideas that can be executed in open manner

  1. Saraswati has shared some datasets that might help for hackathon.
    https://cloud.google.com/healthcare-api/docs/resources/public-datasets
    https://www.kaggle.com/datasets/uciml/breast-cancer-wisconsin-data
    https://data.gov.in/keywords/Diagnostic

  2. https://arxiv.org/pdf/2303.18223.pdf check out this for survey of LLMs

  3. Also, check out this dataset that I found - https://huggingface.co/datasets/knowrohit07/know_medical_dialogues for anyone who wants to finetune and then try some nifty use-cases on top of this.

  4. @Saraswati Chandra has suggested these ideas for the hackathon
    Assuming this is in India context, here are some ideas for health related projects:
    ML
    Currently are predictively good at intent and pattern recognition
    Focus Area - Improving technicians efficiency OR resource deployment (public health)
    Potential Use cases
    a. Nurse hiring and deployment for villages (PH problem)
    b. ASHA workers route map plan
    c. Infectious disease (Malaria) potential harm map
    d. Breast cancer screening
    LLM
    Currently are predictively good at user interaction
    Focus area — Customer Care Journey OR customer Info (Public Health)
    Potential Use Cases

  • Vaccine Information & Reminders
  • Maternal care journey interaction
  • Post-Cancer Care
  • Pre&Post Clinic Visit interaction mgmt
  1. https://github.com/langroid/Awesome-LLM?tab=readme-ov-file#open-llm Folks, since many folks were asking what are the open src llms, take a look here for some of them
    10.https://arvindsaraf.medium.com/technology-for-impact-9b1c2c2c2934

    https://arvindsaraf.medium.com/regulating-ai-1aa732d8f82e
    Between them, one can pull up some ideas.

  2. Quality set of articles and notebooks on LLMs - https://github.com/ghimiresunil/LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing Please go through these resources - some of them will be helpful for sure.

  3. An example of quick hacks that folks are doing with LLMs and prompt engineering - https://arxiv.org/pdf/2401.14447.pdf

  4. https://github.com/poloclub/wordflow/ check out this to see how nicely they orchestrated the ideas/code/roadmap etc.

  5. https://lilacai-lilac.hf.space/datasets#lilac/OpenHermes-2.5&rowId="0000215f-9b07-46da-a8fa-b23aa28f1ba3" since datasets are useful for LLM pre-training/finetuning open source projects, this is a good example of app that will understand structure of these datasets.

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Make a submission

Accepting submissions till 15 Feb 2024, 11:00 PM

Microsoft Reactor Bengaluru, Bengaluru

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Supported by

Partner

Microsoft for Startups Founders Hub is a digital ecosystem removing barriers to building a company with free access to technology, coaching, and support for founders in any stage of development. Let us accelerate your startup journey from idea-to-exit. Find out more here: https://startups.microsoft… more