Leverage Kubernetes OCI image as volume feature
Is your feature request related to a problem? Please describe.
Reduce latency for pulling model weights
Describe the solution you'd like
Leverage the ImageVolume feature in Kubernetes to allow model weights to be mounted to a pod as an OCI image.
Describe alternatives you've considered
Additional context
let us do some test to see how much time we can benefit from using oci-volume. @nojnhuh phi4 model is suitable for this test. Use this script to generate a testing manifest. https://github.com/kaito-project/kaito/blob/main/presets/workspace/test/scripts/README.md
@zhuangqh moving this item to In Progress
@zhuangqh can you provide an update here?