musoles
musoles
### Your current environment The output of `python env.py` Collecting environment information... PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch:...
### Your current environment The output of `python env.py` Collecting environment information... PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch:...
### 🚀 Feature Description and Motivation I can see in the documentation information for how to use autoscaling and how to do multi-node deployments, but not on both at the...
### ❓ Is your enhancement related to a problem? Currently it's possible to set a static number of replicas per model, but this is often leads to over-provisioning (underutilised replicas)...