Open Source AI Hackathon 2024

GenAI makers and creators contest and showcase

Tickets

Loading…

Infant Sam Christian

@thelonesamurai

SimGen - the ultimate solution for generating synthetic datasets.

Submitted Nov 18, 2023

Problem Statement

Generating Synthetic Dataset of Indian Roads - Problems

  • Generate photo-realistic images of urban Indian driving scenes.
  • Simulate complexities like traffic congestion, unexpected road elements, and various weather conditions.
  • Enable real-time interaction with the server to produce photorealistic images.

Solution

  • Simulation based solution that can be integrated with Unity to capture the simulation and process it to provide a generated realistic image dataset.
  • Enable real-time configuration and simulation to create a varied level of dataset of huge size and different condition.
  • The tool process the video recorded in to frame and processes “n” frames from each video in equal interval.
  • The image taken is processed through a pre-finetuned version of StableDiffusion and 2 levels of controlnets.
  • The final processed images and the intermediary steps are stored in a database.

Comments

Login to leave a comment

  • A

    Akshobhya

    @akshobhya_j Editor & Promoter

    Hello @thelonesamurai!

    The hack day for The Fifth Elephant Open Source AI Hackathon officially kicked off on Saturday, 3rd February! If you haven't started already, now is the perfect moment to dive in and begin building your project.

    Your project needs additional information to be shortlisted in The Fifth Elephant Open Source AI Hackathon.

    Don't miss this opportunity to collaborate, create, and possibly win one of the five prizes of INR. 1,00,000 each. Submit your idea, join the conversation with our mentors, and make the most of this unique hackathon experience.

    All the best and happy hacking!

    Posted 1 year ago
  • A

    Akshobhya

    @akshobhya_j Editor & Promoter

    @thelonesamurai, Thank you for your proposal submission to The Fifth Elephant Open Source AI Hackathon. Your project idea of creating SimGen as the ultimate solution for generating synthetic datasets is promising and addresses a critical need in the field of AI and computer vision.

    This submission needs to be updated based on the following considerations.

    Roadmap and Plan of Action

    1. Define clear milestones and deliverables:
      • Outline a detailed roadmap depicting the key phases of development, testing, and refinement.
      • Specify the timeline for each milestone to ensure progress tracking.
    2. Determine which datasets will be utilized and how they will be processed:
      • Provide specifics on the types of Indian urban driving scenes that will be simulated.
      • Detail the process for capturing and processing these scenes to generate the synthetic dataset.
    3. Explain how the solution will integrate with Unity for simulation and image dataset generation:
      • Elaborate on the technical aspects of the integration and the benefits it will offer.

    Dataset Generation Process

    1. Provide detailed insights into the generation of photo-realistic images of Indian driving scenes:
      • Describe the methods and technologies that will be employed to achieve photorealism.
      • Explain how traffic congestion, unexpected road elements, and various weather conditions will be simulated in the dataset.
    2. Articulate the real-time interaction capabilities with the server:
      • Clarify how real-time configuration and simulation will be achieved to produce datasets of varied sizes and conditions.

    Technical Implementation

    1. Elaborate on the video processing and image generation workflow:
      • Provide a comprehensive overview of the video-to-frame conversion process and the subsequent image processing steps.
      • Explain the rationale behind utilizing StableDiffusion and 2 levels of controlnets for image processing.
    2. Detail the database storage and organization of processed images:
      • Clarify the structure of the database where the final images and intermediary steps will be stored.
      • Address data management aspects such as retrieval, indexing, and scalability.

    Feedback and Iteration

    • Consider incorporating mechanisms for automatic evaluation and validation of the generated synthetic datasets.

    • Ensure that the GitHub repository contains a comprehensive README.md documenting the project's purpose, technical details, setup instructions, and example use cases.

    • Provide the GitHub repository link in your proposal for easy access and review by mentors and the jury.

    • Utilize the available platforms such as The Fifth Elephant WhatsApp group to engage with mentors and seek guidance on technical and implementation aspects of your project.

    • The SimGen project has the potential to significantly impact the availability of realistic datasets, thereby benefiting AI and computer vision research and development in the Indian context.

    Best of luck with the continued development of SimGen, and we look forward to seeing the evolution of your project in the hackathon!

    Posted 1 year ago
Hybrid access (members only)

Hosted by

The Fifth Elephant hackathons

Supported by

Host

Jump starting better data engineering and AI futures

Venue host

Welcome to the events page for events hosted at The Terrace @ Hasura. more

Partner

Providing all founders, at any stage, with free resources to build a successful startup.