The Fifth Elephant

The Fifth Elephant 2024 Annual Conference (12th &13th July)

Maximising the Potential of Data — Discussions around data science, machine learning & AI

Jul 2024

8 Mon

9 Tue

10 Wed

11 Thu

12 Fri

13 Sat 09:00 AM – 06:05 PM IST

14 Sun

Bangalore International Centre, Bangalore

All submissions

Previous Next

LLM's Anywhere: Browser Deployment with Wasm & WebGPU

Submitted Jun 21, 2024

Session type: 30 mins talk

LLM’s Anywhere: Browser Deployment with Wasm & WebGPU

In today’s interconnected world, deploying and accessing machine learning (ML) models efficiently poses major challenges. Traditional methods rely on cloud GPU clusters and constant internet connectivity. However, WebAssembly (Wasm) and WebGPU technologies are revolutionizing this landscape. This talk explores leveraging Wasm and WebGPU to deploy small language models (SLMs) directly within web browsers, eliminating the need for extensive cloud GPU clusters and reducing reliance on constant internet access. We showcase practical examples and discuss how Wasm enables efficient cross-platform ML model execution while WebGPU optimizes parallel computation within browsers. Join us to discover how this fusion empowers developers and users alike with unprecedented ease and efficiency in browser-based ML, while reducing dependence on centralized cloud infrastructure and internet connectivity constraints.

Outline

Introduction
- Overview of traditional ML deployment challenges
- Introduction to WebAssembly (Wasm) and WebGPU
WebAssembly for Cross-Platform ML Execution
- Benefits of Wasm for ML models
- Cross-platform compatibility and performance
WebGPU for Optimized Parallel Computation
- Advantages of WebGPU in browsers
- Practical examples of ML models using WebGPU
Practical Deployment Examples
- Demonstrations of small language models (SLMs) in action
- Case studies of browser-based ML applications
Benefits to the Ecosystem
- Reduced infrastructure costs
- Enhanced accessibility and performance for users
- Simplified deployment processes for developers
- Improved scalability and adaptability
- Enhanced privacy and security through local data processing
Future Directions and Innovations
- Potential developments in Wasm and WebGPU
- Expanding the scope of browser-based ML deployment
Q&A Session
- Addressing audience questions and feedback

Impact

The talk on leveraging WebAssembly and WebGPU for browser-based machine learning deployment offers numerous benefits to the ecosystem. It significantly reduces infrastructure costs by eliminating the need for large GPU clusters in the cloud, making ML accessible to a broader audience. Users experience enhanced accessibility and improved performance with seamless access to ML inference capabilities directly in their browsers. Developers benefit from simplified deployment processes and increased productivity, while scalability ensures adaptable solutions for varying workloads. Furthermore, deploying ML models within the browser enhances privacy and security by keeping data processing local, while WebAssembly’s cross-platform compatibility ensures accessibility across different devices and operating systems. By fostering innovation and empowering emerging markets with limited access to high-performance computing resources, this approach contributes to a more inclusive and sustainable ecosystem for browser-based machine learning deployment.

All submissions

Previous Next

Comments

Jul 2024

8 Mon

9 Tue

10 Wed

11 Thu

12 Fri

13 Sat 09:00 AM – 06:05 PM IST

14 Sun

Hosted by

The Fifth Elephant

Jumpstart better data engineering and AI futures

Supported by

Gold Sponsor

Atlassian

Atlassian unleashes the potential of every team. Our agile & DevOps, IT service management and work management software helps teams organize, discuss, and compl

Silver Sponsor

Google

Together, we can build for everyone.

Workshop sponsor

Datastax

Datastax, the real-time AI Company.

Lanyard Sponsor

Uber

We reimagine the way the world moves for the better.

Sponsor

Monster API

MonsterAPI is an easy and cost-effective GenAI computing platform designed for developers to quickly fine-tune, evaluate and deploy LLMs for businesses.

Community Partner

FOSS United Foundation

FOSS United is a non-profit foundation that aims at promoting and strengthening the Free and Open Source Software (FOSS) ecosystem in India. more

Beverage Partner

BONOMI

BONOMI is a ready to drink beverage brand based out of Bangalore. Our first segment into the beverage category is ready to drink cold brew coffee.