GLM-130B
GLM-130B copied to clipboard
Does it support deep speed ZeRO to offload parameters to CPU and NVMe ssd?
I had used ChatGLM-6B. I could use deep speed tech to offload the parameters to CPU and NVMe ssd. So I could finetune the model on a machine with only one T4 16G card. Dose ChatGLM-130B support deep speed too? Does it support ZeRO stage 3 to offload parameters to CPU memory and NVMe ssd?