aibrix
aibrix copied to clipboard
Automate local disk management and ai runtime model management
🚀 Feature Description and Motivation
In DeepSeek-R1 examples, we notice setting up ReadWriteMany volume is kind of painful for cloud agnostic environments. S3FS based solutions are not very efficient. Most efficiency way is to fully leverage bandwidth to download models from remote object store (0->1) and then load it from disk to GPU memory(cover restarts, redeploy case). However, even most high end machine provides local disk, they are not efficiently managed. We need some local disk management solutions along with runtime cache capabilities (like huggingface cache folder) to streamline this process.
Use Case
efficiently load model weights from object storage and manage in local disks.
Proposed Solution
No response