Mattia Bradascio
Mattia Bradascio
Disabling containerd does it on OSX (Docker Desktop).
For this I'd recommend setting up local storage that is S3 compatible like Minio 👍🏼
I have another issue where the default runai streamer simply terminates my vLLM container after a certain (relatively short) time-period: ``` (VllmWorker rank=0 pid=783) Loading safetensors using Runai Model Streamer:...
@nijave wow that is quite hectic. This model is much larger 235B. I have my object store on NVMe in the same network, so its technically already on disk, I...
@nijave in my case this was actually just a healthcheck that failed - and it turned out to work perfectly without using sharding by simply setting the health and liveness...