Submissions
Akshat Gupta

Akshat Gupta

@akshatg

Experienced Deep Learning practitioner specializing in Computer Vision and Generative AI, leveraging TensorFlow, Pytorch, and advanced ML techniques to develop

  • Joined Jun 2023

The Fifth Elephant 2023 Monsoon

Harmonising Art and AI: Crafting Jazzy and Juicy Video Snippets through AI

Abstract In recent times, Live Streaming platforms are gaining popularity where live content is being shown to users. Typically, the videos created by the creators range from 15 minutes to an hour. After intensive research, it was found that a sizable chunk of users drops within first 30 seconds of the video. Another piece of research shows that, on average, a user only has an attention span of 3… more
  • 4 comments
  • Submitted
  • 30 Jun 2023

GenerativeAI July Meetup

Harmonising Art and AI: Crafting Jazzy and Juicy Video Snippets through AI

Abstract In recent times, Live Streaming platforms are gaining popularity where live content is being shown to users. Typically, the videos created by the creators range from 15 minutes to an hour. After intensive research, it was found that a sizable chunk of users drops within first 30 seconds of the video. Another piece of research shows that, on average, a user only has an attention span of 3… more
  • 0 comments
  • Submitted
  • 20 Jul 2023

The Fifth Elephant 2023 Winter

Automated Genre Based Music Morphing using AI

Abstract In the realm of audio processing and music production, automated genre-based audio morphing is an emerging field that merges the creative boundaries of different music genres through the power of machine learning and natural language processing (NLP). This innovative approach leverages textual prompts to drive the transformation of audio content, transcending traditional genre constraint… more
  • 0 comments
  • Confirmed & scheduled
  • 30 Sep 2023

Open Source AI Hackathon 2024

Musickiya

Problem Statement As GenAI is on its path to revolutionise the way most things are done, we propose an innovative application of it. For the last many years, we have witnessed AI assistants that mostly assist with specific day to day activities like writing, setting up an alarm, etc, and mode of communication if typically either via chat or voice commands. We have designed a Music Assistant calle… more
  • 4 comments
  • Confirmed & scheduled
  • 24 Jan 2024
Category: AI for image generation/creatives

Advancing multimodal and agentic AI: systems, storage & scalability

The Emergence of Multi-Agent Systems in Computer Vision: A New Era of Creative Collaboration

The core advantage of multi-agent systems in computer vision lies in their ability to divide complex tasks into smaller, manageable sub-tasks, allowing for more efficient and scalable processing. This paradigm fosters collaboration, where agents can exchange information and update each other’s knowledge, resulting in a more refined outcome, keeping every agrnt in sync. more
  • 1 comment
  • Confirmed & scheduled
  • 25 Mar 2025
Type of session: Sponsored workshop I am submitting for: Blr OSAI meetup in April 2025