llm-serving topic

List llm-serving repositories

runbooks

168
Stars
14
Forks
Watchers

Finetune LLMs on K8s by using Runbooks

lorax

2.1k
Stars
139
Forks
Watchers

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

SwiftInfer

452
Stars
25
Forks
Watchers

Efficient AI Inference & Serving

rtp-llm

519
Stars
48
Forks
Watchers

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

ray-educational-materials

339
Stars
63
Forks
Watchers

This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.

ray_vllm_inference

49
Stars
4
Forks
Watchers

A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.

llms-in-prod-workshop-2023

26
Stars
3
Forks
Watchers

Deploy and Scale LLM-based applications

aici

1.9k
Stars
78
Forks
Watchers

AICI: Prompts as (Wasm) Programs