fastertransformer topic

List fastertransformer repositories

serving-codegen-gptj-triton

20
Stars
0
Forks
Watchers

Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes

lmdeploy

7.3k
Stars
625
Forks
7.3k
Watchers

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.