DeepSpeedExamples icon indicating copy to clipboard operation
DeepSpeedExamples copied to clipboard

RuntimeError with BLOOMZ: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!

Open karim1104 opened this issue 2 years ago • 1 comments

I'm facing the above error in both stage 1 and stage 2 when using BLOOMZ 3B and 560M. I tried adding "model.to(device)" and "model.to('cuda')" to main.py but neither worked. The error only appears when I switch from Llama to BLOOMZ.

karim1104 avatar Jun 17 '23 04:06 karim1104

hi, whether you have figured out the reason?

XuJing1022 avatar Dec 05 '23 15:12 XuJing1022