Open Source AI Hackathon 2024

GenAI makers and creators contest and showcase

Tickets
  • Select Tickets
  • Payment
  • invoice
  • Attendee details

Membership

The Fifth Elephant annual membership

The Fifth Elephant membership is valid for one year - 12 months. The member get the following benefits:

  • Participation in all online peer review sessions.
  • Access to all recordings from online reviews.
  • Priority access to all offline meet-ups and online workshops hosted by The Fifth Elephant during the one year period.
  • Access to The Fifth Elephant’s Annual Conference on 18 and 19 July 2025 in Bangalore - in-person and virtually (via live stream).

Corporate Members-only benefits (bulk ticket purchase):

  • Transfer of memberships across individuals in the organization.

Memberships can be cancelled within 1 hour of purchase.

₹5100

×

Sale at this price closes on December 31, 2025

Total ₹0

Cancellation and refund policy

Memberships can be cancelled within 1 hour of purchase

Workshop tickets can be cancelled or transferred upto 24 hours prior to the workshop.

For further queries, please write to us at support@hasgeek.com or call us at +91 7676 33 2020.

Suhas

Speech to text for Indian Healthcare

Submitted Nov 20, 2023

Speech to text for Indian Healthcare

If you had access to a speech-to-text model which is specifically trained on English audio taken from Indian speakers, it would be more accurate than the audio models trained on general audio data. And if we extend this to the medical domain, then the accuracy of such models would be significantly higher than the SOTA models like whisper which are not trained specifically on Indian context.

In this hackathon I would like to work on this project to:

  1. Explore current speech-to-text models in Indian context
  2. Explore pulbic data-sets with audio data of Indian speakers available in english and Healthcare field
  3. Fine-tune SOTA models like whisper and distil-whisper and compare results

Currently a lot of applications are being built for health-care professionals to help digitize patient healthrecords, clinical recordings etc. With models trained for such specific domain, it will be easier for others to build applications over them.

Comments

Login to leave a comment

  • A

    Akshobhya

    @akshobhya_j Editor & Promoter

    Hello Suhas!

    The hack day for The Fifth Elephant Open Source AI Hackathon officially kicked off on Saturday, 3rd February! If you haven't started already, now is the perfect moment to dive in and begin building your project.

    Your project needs additional information to be shortlisted in The Fifth Elephant Open Source AI Hackathon.

    Don't miss this opportunity to collaborate, create, and possibly win one of the five prizes of INR. 1,00,000 each. Submit your idea, join the conversation with our mentors, and make the most of this unique hackathon experience.

    All the best and happy hacking!

    Posted 1 year ago
  • A

    Akshobhya

    @akshobhya_j Editor & Promoter

    Suhas, thank you for your proposal submission to The Fifth Elephant Open Source AI Hackathon. The focus on leveraging speech-to-text technology for the Indian healthcare context is commendable and holds great potential for improving accuracy and applicability in this domain.

    This submission needs to be updated based on the following considerations.

    Speech-to-Text Model Enhancement

    1. Advantages of Indian English Audio Training:
      • Highlight the benefits of using speech-to-text models specifically trained on English audio from Indian speakers.
      • Emphasize the potential accuracy improvements compared to models trained on general audio data.

    Proposed Project Goals

    1. Investigating Speech-to-Text Models in the Indian Context:
      • Outline the intention to explore existing speech-to-text models tailored for the Indian context, showcasing a clear understanding of the project's starting point.
    2. Exploration of Public Datasets for Indian English Audio in Healthcare:
      • Clearly define the plan to explore publicly available datasets containing audio data from Indian speakers within the healthcare domain, demonstrating a data-driven approach.
    3. Comparative Analysis of SOTA Models:
      • Discuss the strategy to fine-tune state-of-the-art (SOTA) models like Whisper and Distil-Whisper with the aim of comparing results, showcasing a commitment to rigorous evaluation and improvement.

    Impact on Healthcare Applications

    • Underline the significance of developing domain-specific speech-to-text models for healthcare professionals and the potential for facilitating the digitization of patient health records and clinical recordings.

    Project Evaluation and Implementation

    1. Address the Need for Domain-Specific Models:
      • Recognize the increasing demand for applications tailored to healthcare professionals and underscore the potential ease of application development when using domain-specific models.
    2. Aligning Project Goals with Healthcare Industry Needs:
      • Emphasize the potential benefits of leveraging models trained for the specific healthcare domain in order to encourage wide-ranging applications and solutions.

    Feedback and Iteration

    • Provide the GitHub repository link in your proposal for easy access and review by mentors and the jury.

    • Ensure that the GitHub repository contains a comprehensive README.md documenting the project's purpose, technical details, setup instructions, and example use cases.

    • Utilize the available platforms such as The Fifth Elephant WhatsApp group to engage with mentors and seek guidance on technical and implementation aspects of your project.

    We encourage the pursuit of this project, emphasizing the significant value it holds for the healthcare industry and anticipate the advancement and outcomes of the proposed work.

    Posted 1 year ago
Hybrid access (members only)

Hosted by

The Fifth Elephant hackathons

Supported by

Host

Jump starting better data engineering and AI futures

Venue host

Welcome to the events page for events hosted at The Terrace @ Hasura. more

Partner

Providing all founders, at any stage, with free resources to build a successful startup.