llm-serving topic

List llm-serving repositories

BentoML

8.3k
Stars
891
Forks
8.3k
Watchers

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

get-beam

96
Stars
24
Forks
Watchers

Run GPU inference and training jobs on serverless infrastructure that scales with you.

helix

547
Stars
59
Forks
547
Watchers

♾️ Helix is a private GenAI stack for building AI agents with declarative pipelines, knowledge (RAG), API bindings, and first-class testing.

llm-action

9.3k
Stars
912
Forks
Watchers

本项目旨在分享大模型相关技术原理以及实战经验。

BurstGPT

112
Stars
5
Forks
Watchers

A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

happy_vllm

22
Stars
1
Forks
Watchers

A REST API for vLLM, production ready

Awesome_LLM_System-PaperList

156
Stars
6
Forks
Watchers

Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on inferenc...

llm-inference-solutions

67
Stars
3
Forks
Watchers

A collection of all available inference solutions for the LLMs

Awesome-LLM-Productization

20
Stars
4
Forks
Watchers

Awesome-LLM-Productization: a curated list of tools/tricks/news/regulations about AI and Large Language Model (LLM) productization

pratical-llms

38
Stars
9
Forks
Watchers

A collection of hand on notebook for LLMs practitioner