text-generation-inference topic

List text-generation-inference repositories

optimum-benchmark

240
Stars
43
Forks
Watchers

🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.

harbor

396
Stars
19
Forks
Watchers

Effortlessly run LLM backends, APIs, frontends, and services with one command.

llmaz

20
Stars
10
Forks
Watchers

☸️ Easy, advanced inference platform for large language models on Kubernetes