aibrix
aibrix copied to clipboard
Provide production grade overlay manifests
🚀 Feature Description and Motivation
We encounter some stability issues in the past, https://github.com/vllm-project/aibrix/pull/825 was proposed to address such issues. We should use this opportunity to further check current component setups including resources, QoS, health check, availability settings (replicas) etc.
Beside the code and manifest change, we should also provide comprehensive docs telling user how to tune these configuration for their environments. User can fully customize such parameters for their own environments.
Use Case
for stable production grade setup
Proposed Solution
No response