QIE TANG

Results 4 comments of QIE TANG
trafficstars

Any update? I encountered same issue. I simply set environment variables by two command: export MP=1, export WORLD_SIZE=1 . Then start training of actor with "fairscale True" in config.yaml.

还有就是,readme里面qlora的训练命令使用的是train.py,这个是不是写错了,应该用train_qlora.py?

这个我问过transformers那边了,说目前deepspeed不支持4bit/8bit训练,所以目前只能ddp,zero optimization应该都是不行的