DeepSpeedExamples
DeepSpeedExamples copied to clipboard
autotuning, error
hi, thanks for your great job.
I'm trying to start autotuning task, but stuck with some errors.
entry script
cd DeepSpeedExamples/autotuning/hf/gpt2-medium
bash test_tune.sh z0
, ok
bash test_tune.sh tune
, errors
environment
ngc-23.03 apt-get install nfs-common pdsh
error infos
![image](https://user-images.githubusercontent.com/45474996/201101056-b6466994-187a-4cdf-875d-1a0703649d76.png)
some questions
- is there any recommended image or dockerfile, to avoid the environment problem?
- how to solve the above errors