kubernetes-engine-samples
kubernetes-engine-samples copied to clipboard
Code for AI on GKE guide series
- Kustomize patches to run various quantized models in vLLM and TGI runtimes.