LLM-VM issues

Implement FSDP for training large datasets

1

Definition of done: Implement training large models using FSDP to accelerate training on large datasets. Reference: https://pytorch.org/blog/introducing-pytorch-fully-sharded-data-parallel-api/

TheRealVish

feat/enhancement

Python profiler for CPU, GPU and Memory

7

Could possibly be done using `Scalene`

VictorOdede

feat/enhancement

good first issue

SWAG

frontend

hacktoberfest

Add disk offloading for onsite LLM inference

1

Enable large models that can't fit on the GPU to run inference by passing params back and forth between RAM and GPU-RAM

VictorOdede

feat/enhancement

Load-balancing / auto-scaling for LLM serving on AWS

3

VictorOdede

feat/enhancement

SWAG

improvement

Load-balancing / auto-scaling for LLM serving on Google Cloud

2

VictorOdede

feat/enhancement

SWAG

improvement

Load-balancing / auto-scaling for LLM serving on Azure

5

VictorOdede

feat/enhancement

SWAG

improvement

Hosted inference + finetuning interface for Azure

3

VictorOdede

feat/enhancement

improvement

LLM-VM does not support multiple GPUs currently

4

Currently LLM-VM does not support multiple GPU setups. Using runpod, I rented a setup with 2 RTX 3090 GPUs. Well running the local Bloom model example from the [docs](https://anarchy.ai/get_started/quickstart/completions). I...

MehmetMHY

bug

feat/enhancement

HIGH-PRIORITY

improvement

Blog tickets

5

We discussed YouTube videos in our latest triage but before we publish any videos, it would be good to have the video content as blogs.

INF800

documentation

good first issue

SWAG

Mega Ticket - Gallery

6

open bounty for demo applications that work to add to a curated example gallery

cartazio

good first issue

help wanted

SWAG

LLM-VM
LLM-VM copied to clipboard

Metadata

Implement FSDP for training large datasets

Python profiler for CPU, GPU and Memory

Add disk offloading for onsite LLM inference

Load-balancing / auto-scaling for LLM serving on AWS

Load-balancing / auto-scaling for LLM serving on Google Cloud

Load-balancing / auto-scaling for LLM serving on Azure

Hosted inference + finetuning interface for Azure

LLM-VM does not support multiple GPUs currently

Blog tickets

Mega Ticket - Gallery

← Metadata

Owner

Metadata

LLM-VM LLM-VM copied to clipboard

Metadata

← Metadata

Owner

Metadata

LLM-VM
LLM-VM copied to clipboard