Open source AI Hackathon

Open source AI Hackathon

The Fifth Elephant Winter Edition hackathon

About the hackathon

Whether you are an amateur, an AI enthusiast, or a veteran ML engineer, we invite you to participate in The Fifth Elephant Open Source AI hackathon. We will have experienced mentors giving you guidance on the projects that you are working on. And did we say cash prizes!?!?!

You can submit your project idea and outline here.

Things to keep in mind

  1. Only open-source models and projects will be valid. So please use LLaMa 2 model by Meta or other open-source models only.
  2. Jury members will need a link to a demo, and your GitHub repo for the project.
  3. A public video link to the demo will also be required. You can use this for the in-person presentation to the jury members as well.

Themes

Participants can propose projects which cover the spectrum of GenAI. Following are some of the themes you can work on, and which the jury will consider:

  1. AI for Scientific Research: e.g. Protein folding models, climate models, drug discovery, image recognition for scientific research, simulations for material science, epidemiology, and more.
  2. AI for inclusivity and accessibility: e.g. STT/TTS, automated audio descriptions (for non-voice content), automated color blindness correction, AI powered sign language generation, real-time AI powered captioning display for events, educational resources, and content translation across languages by leveraging multi-lingual models, adaptive content for differences in learning ability and/or neurodivergence, etc.
  3. AI and creative expression: e.g., generative audio, video, text and visuals and ways to combine these in a production-oriented direction, including AR/VR/Gaming and OTT implementations.
  4. AI in education: e.g., personalized learning plans, adaptive learning plans, content creation, translation with context, AI tutors, productivity tools, well-being improvement tools, etc.
  5. AI for India: for e.g., India-specific law, models that focus on indic languages, renewable energy optimization, disaster response and relief, education accessibility.

Curators of the hackathon

Divya Tak: co-founder at Joyus, a multi-disciplinary creative. Divya also runs the AI for creatives community.

Bharat Shetty is an AI/ML Consultant. He has worked for Airtel Labs and other organizations on AI/ML/NLP platforms and products, across diverse verticals such as conversational AI, EdTech, IOT, and healthcare. Bharat is the editor of The Fifth Elephant Winter edition, and papers discussion community.

Mentors - to be announced

Jury - to be announced

Submission guidelines for projects

All submissions for the projects must be made here. Your ideas and projects are iterative. Use this opportunity to discuss your ideas with the curators and mentors, and improve on them as the hackathon date approaches.

Submissions are also helpful to find collaborators for your projects. Be open and forthcoming in sharing your ideas.

Prizes

Five prizes of Rs. 1,00,000 (one lakh rupees) each, will be given to winners at the hackathon. One prize is allocated for each theme.

Contact information

If you have questions about the format of the hackathon, post a comment here.

Join The Fifth Elephant Telegram group or the WhatsApp group.

Follow @fifthel on Twitter.

For any inquiries, call The Fifth Elephant on +91-7676332020.

Sponsored by Meta

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Supported by

Suhas

Speech to text for Indian Healthcare

Submitted Nov 20, 2023

Speech to text for Indian Healthcare

If you had access to a speech-to-text model which is specifically trained on English audio taken from Indian speakers, it would be more accurate than the audio models trained on general audio data. And if we extend this to the medical domain, then the accuracy of such models would be significantly higher than the SOTA models like whisper which are not trained specifically on Indian context.

In this hackathon I would like to work on this project to:

  1. Explore current speech-to-text models in Indian context
  2. Explore pulbic data-sets with audio data of Indian speakers available in english and Healthcare field
  3. Fine-tune SOTA models like whisper and distil-whisper and compare results

Currently a lot of applications are being built for health-care professionals to help digitize patient healthrecords, clinical recordings etc. With models trained for such specific domain, it will be easier for others to build applications over them.

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Supported by