vllm topic

List vllm repositories

Awesome-LLM-Reasoning

1.2k
Stars
60
Forks
Watchers

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

Awesome-LLM-Inference

1.5k
Stars
118
Forks
Watchers

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

swift

1.7k
Stars
168
Forks
12
Watchers

ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 25+ MLLMs

llm-vscode-inference-server

44
Stars
6
Forks
Watchers

An endpoint server for efficiently serving quantized open-source LLMs for code.

OpenRLHF

1.3k
Stars
123
Forks
Watchers

An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)

booster

124
Stars
4
Forks
Watchers

Booster - open platform for serving LLM models

super-json-mode

347
Stars
11
Forks
Watchers

Low latency JSON generation using LLMs ⚡️

llama-recipes

9.9k
Stars
1.4k
Forks
81
Watchers

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a...

llm-atc

82
Stars
2
Forks
Watchers

Fine-tuning and serving LLMs on any cloud

TinyLLM

102
Stars
9
Forks
Watchers

Setup and run a local LLM and Chatbot using consumer grade hardware.