text-generation-inference topic
List
text-generation-inference repositories
optimum-benchmark
240
Stars
43
Forks
Watchers
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.
harbor
396
Stars
19
Forks
Watchers
Effortlessly run LLM backends, APIs, frontends, and services with one command.
llmaz
20
Stars
10
Forks
Watchers
☸️ Easy, advanced inference platform for large language models on Kubernetes