Automate local disk management and ai runtime model management

Open Jeffwan opened this issue 9 months ago • 0 comments

🚀 Feature Description and Motivation

In DeepSeek-R1 examples, we notice setting up ReadWriteMany volume is kind of painful for cloud agnostic environments. S3FS based solutions are not very efficient. Most efficiency way is to fully leverage bandwidth to download models from remote object store (0->1) and then load it from disk to GPU memory(cover restarts, redeploy case). However, even most high end machine provides local disk, they are not efficiently managed. We need some local disk management solutions along with runtime cache capabilities (like huggingface cache folder) to streamline this process.

Use Case

efficiently load model weights from object storage and manage in local disks.

Proposed Solution

No response

Mar 12 '25 14:03 Jeffwan