kaito icon indicating copy to clipboard operation
kaito copied to clipboard

Leverage Kubernetes OCI image as volume feature

Open ritazh opened this issue 7 months ago • 3 comments

Is your feature request related to a problem? Please describe.

Reduce latency for pulling model weights

Describe the solution you'd like

Leverage the ImageVolume feature in Kubernetes to allow model weights to be mounted to a pod as an OCI image.

Describe alternatives you've considered

Additional context

ritazh avatar Apr 28 '25 22:04 ritazh

let us do some test to see how much time we can benefit from using oci-volume. @nojnhuh phi4 model is suitable for this test. Use this script to generate a testing manifest. https://github.com/kaito-project/kaito/blob/main/presets/workspace/test/scripts/README.md

zhuangqh avatar May 05 '25 13:05 zhuangqh

@zhuangqh moving this item to In Progress

sdesai345 avatar Jun 24 '25 14:06 sdesai345

@zhuangqh can you provide an update here?

sdesai345 avatar Aug 05 '25 14:08 sdesai345