vllm topic

List vllm repositories

Awesome-LLM-Reasoning

3.5k
Stars
201
Forks
3.5k
Watchers

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

Awesome-LLM-Inference

4.9k
Stars
330
Forks
4.9k
Watchers

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

ms-swift

11.9k
Stars
1.1k
Forks
11.9k
Watchers

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi...

llm-vscode-inference-server

52
Stars
8
Forks
Watchers

An endpoint server for efficiently serving quantized open-source LLMs for code.

OpenRLHF

8.7k
Stars
841
Forks
8.7k
Watchers

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

booster

137
Stars
6
Forks
Watchers

Booster - open accelerator for LLM models. Better inference and debugging for AI hackers

super-json-mode

382
Stars
12
Forks
Watchers

Low latency JSON generation using LLMs ⚡️

llama-cookbook

17.0k
Stars
2.4k
Forks
Watchers

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model f...

llm-atc

85
Stars
2
Forks
Watchers

Fine-tuning and serving LLMs on any cloud

TinyLLM

159
Stars
14
Forks
Watchers

Setup and run a local LLM and Chatbot using consumer grade hardware.