南栖

Results 53 comments of 南栖

> Hey @Minami-su, on which version of transformers are you? It reminds me of an older issue #30082 very similar to this which should have been fixed by #30085 (>=...

The lr shown is not changing,but the actual training lr is changing when I set lr to 1e-5 and 1e-2 ``` lr = 1e-5 {'loss': 1.7991, 'grad_norm': 0.0, 'learning_rate': 0.001,...

@skepsun @Mrkkew @slin000111 I'm releasing an open-source framework By combining GRPO + QLoRA + DeepSpeed ZeRO-3,https://github.com/Minami-su/deepspeed-grpo-qlora-vllm