fastertransformer topic
List
fastertransformer repositories
serving-codegen-gptj-triton
20
Stars
0
Forks
Watchers
Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes
lmdeploy
4.5k
Stars
404
Forks
Watchers
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.