DeepSpeedExamples icon indicating copy to clipboard operation
DeepSpeedExamples copied to clipboard

RuntimeError: Step 1 exited with non-zero status 1

Open yudonglee opened this issue 1 year ago • 13 comments

After finishing install successfully, i got this error when ran this command: python train.py --actor-model facebook/opt-1.3b --reward-model facebook/opt-350m --num-gpus 1

---=== Running Step 1 ===--- Traceback (most recent call last): File "/data/DeepSpeedExamples/applications/DeepSpeed-Chat/train.py", line 218, in main(args) File "/data/DeepSpeedExamples/applications/DeepSpeed-Chat/train.py", line 203, in main launch_cmd(cmd, step_num) File "/data/DeepSpeedExamples/applications/DeepSpeed-Chat/train.py", line 192, in launch_cmd raise RuntimeError( RuntimeError: Step 1 exited with non-zero status 1

how to fix it please ?

yudonglee avatar Apr 13 '23 05:04 yudonglee