朱弘泽
朱弘泽
我使用八卡节点的4卡镜像时也会遇到类似的错误。使用八卡镜像正常。
> > 我使用八卡节点的4卡镜像时也会遇到类似的错误。使用八卡镜像正常。 > > 没有解决,一直没有跑起来 确实很奇怪,我用八卡镜像指定export CUDA_VISIBLE_DEVICES=7,2,4 也没有问题,但一但使用4卡镜像就会报错。
(WorkerDict pid=324403) You are attempting to use Flash Attention 2.0 without specifying a torch dtype. This might lead to unexpected behaviour Loading checkpoint shards: 0%| | 0/2 [00:00
> > (WorkerDict pid=324403) You are attempting to use Flash Attention 2.0 without specifying a torch dtype. This might lead to unexpected behaviour Loading checkpoint shards: 0%| | 0/2 [00:00
Please. We need this good job!