ColossalAI icon indicating copy to clipboard operation
ColossalAI copied to clipboard

[BUG]: train_rm.py get lower acc!

Open Yutongamber opened this issue 1 year ago • 3 comments

🐛 Describe the bug

hello, here is a bug, similar with issues-3534

use default Anthropic/hh-rlhf dataset, pretrain_model: bigscience/bloom-1b1 batch_size: 1 max_epochs: 1 max_len: 512 loss_fn: log_sig

loss is random change acc is much lower than than the one reported in the readme.

using the following cmd: image

here is the results: image

Is there any advice?

Thanks!

Environment

No response

Yutongamber avatar Apr 13 '23 09:04 Yutongamber

Has it been trained for one full epoch?

JThh avatar Apr 14 '23 03:04 JThh

Yes, it is trained for one full epoch

Yutongamber avatar Apr 17 '23 01:04 Yutongamber

Hi @Yutongamber Maybe it’s an inappropriate sh command. We have fixed it. Thanks. #3490 https://github.com/hpcaitech/ColossalAI/blob/main/applications/Chat/examples/train_rm.sh

binmakeswell avatar Apr 17 '23 08:04 binmakeswell