fastertransformer topic
List
fastertransformer repositories
serving-codegen-gptj-triton
20
Stars
0
Forks
Watchers
Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes
lmdeploy
7.3k
Stars
625
Forks
7.3k
Watchers
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.