llm-serving topic
List
llm-serving repositories
runbooks
168
Stars
14
Forks
Watchers
Finetune LLMs on K8s by using Runbooks
lorax
2.1k
Stars
139
Forks
Watchers
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
SwiftInfer
452
Stars
25
Forks
Watchers
Efficient AI Inference & Serving
rtp-llm
519
Stars
48
Forks
Watchers
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
LLM-FineTuning-Large-Language-Models
438
Stars
108
Forks
Watchers
LLM (Large Language Model) FineTuning
ray-educational-materials
339
Stars
63
Forks
Watchers
This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.
ray_vllm_inference
49
Stars
4
Forks
Watchers
A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.
llms-in-prod-workshop-2023
26
Stars
3
Forks
Watchers
Deploy and Scale LLM-based applications