DeepSpeedExamples
DeepSpeedExamples copied to clipboard
autotuning, error
hi, thanks for your great job.
I'm trying to start autotuning task, but stuck with some errors.
entry script
cd DeepSpeedExamples/autotuning/hf/gpt2-medium
bash test_tune.sh z0, ok
bash test_tune.sh tune, errors
environment
ngc-23.03 apt-get install nfs-common pdsh
error infos
some questions
- is there any recommended image or dockerfile, to avoid the environment problem?
- how to solve the above errors
@tjruwase Please help me solve the issue, thank you very much.