vllm topic

List vllm repositories

Awesome-LLM-Reasoning

1.6k
Stars
90
Forks
Watchers

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓

Awesome-LLM-Inference

2.6k
Stars
175
Forks
Watchers

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

ms-swift

3.6k
Stars
310
Forks
12
Watchers

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Visio...

llm-vscode-inference-server

52
Stars
8
Forks
Watchers

An endpoint server for efficiently serving quantized open-source LLMs for code.

OpenRLHF

2.1k
Stars
206
Forks
Watchers

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

booster

137
Stars
6
Forks
Watchers

Booster - open accelerator for LLM models. Better inference and debugging for AI hackers

super-json-mode

382
Stars
12
Forks
Watchers

Low latency JSON generation using LLMs ⚡️

llama-recipes

14.8k
Stars
2.1k
Forks
193
Watchers

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a...

llm-atc

85
Stars
2
Forks
Watchers

Fine-tuning and serving LLMs on any cloud

TinyLLM

159
Stars
14
Forks
Watchers

Setup and run a local LLM and Chatbot using consumer grade hardware.