DeepSpeedExamples
DeepSpeedExamples copied to clipboard
Can I train a opt-6.7B model on 4x4090 gpus?
I get 4 4090 gpus, and I want to train the opt-6.7B by using DeepSpeed Chat. Is that possible? I mean I have no idea if I should get a GPU that has enough VRAM or maybe I can use 4 small gpus instead.