Thomas Jack Carroll comments

Results 14 comments of


                                            Thomas Jack Carroll

Support per user api-key for multi-tenant use case

One use-case I'd love to see supported as a tenant-aware optimization is tenant-based LoRA adapters.

Examples should come with health and readiness checks

The [Quickstart Model Sample](https://github.com/vllm-project/aibrix/blob/main/samples/quickstart/model.yaml) already includes checks, but they are too tight for the current model download. 120 seconds is not enough. Going to log an issue and will link...

Examples should come with health and readiness checks

See #772

Replace our cloned 3rd-party yamls with helm charts

I'm willing to take this up

Replace our cloned 3rd-party yamls with helm charts

# Research on Potential Kustomize Version Issues Kustomize added support for helm charts in [v4.1.0](https://github.com/kubernetes-sigs/kustomize/releases/tag/kustomize%2Fv4.1.0). My current kubectl client (found with `kubectl version`) is built with kustomize v5.5.0. Our CI...

Replace our cloned 3rd-party yamls with helm charts

## Another Potential Issue Regarding Usage The [docs](https://github.com/kubernetes-sigs/kustomize/blob/master/examples/chart.md#but-its-not-really-about-performance) for Kustomize state: > Although the helm related fields discussed above are handy for experimentation and development, it's best to avoid them...

Thomas Jack Carroll

Support per user api-key for multi-tenant use case

Examples should come with health and readiness checks

Examples should come with health and readiness checks

Replace our cloned 3rd-party yamls with helm charts

Replace our cloned 3rd-party yamls with helm charts

Replace our cloned 3rd-party yamls with helm charts

[CI] Generate helm package from kubebuilder manifests

Support high availability of gateway server for production users

Applying the charts on gke has an error

Applying the charts on gke has an error