Tianhao Gao comments

Results 18 comments of


                                            Tianhao Gao

RuntimeError: expected scalar type Half but found Float

> > > 调bloomz的时候会有同样的错误，alpaca-lora，alpaca-Cot和BELLE上面都会报这个错误。有无好心人告诉怎么解决 > > > > > > 破案了，我之前用的P40，换了A100能用了 > > A100哪里来的，穷人家的孩子表示很羡慕 hhh，都是公司的，搞了8张A100测试

[BUG]: pytorch单机多卡问题：ERROR: torch.distributed.elastic.multiprocessing.api:failed

same problem

Finetuning is so slow on bloomz-7b1

hi, do you met error "expected scalar type Half but found Float" after change model to bloomz?

use bloom-350m to train reward model in step2

same error

use bloom-350m to train reward model in step2

> same error I have fixed this problem. ref:https://github.com/microsoft/DeepSpeedExamples/issues/571

Error when using BLOOMZ for reward model training

> > > @LiinXemmon Hi, this is caused by log(0) which will return `inf`, I think you should a very small value to difference of two sentences' reward(like 1e-7), it...

During the training of Step 3, the reward score of my language model collapsed to a stable point

> According to readme, "We have found that it is very unstable to use different generation training batch sizes (--per_device_train_batch_size) and PPO training batch sizes (--per_device_mini_batch_size), more than one PPO...

During the training of Step 3, the reward score of my language model collapsed to a stable point

> hello, do you solve it? my average reward is still not increasing during training. > > > > According to readme, "We have found that it is very unstable...