unsloth icon indicating copy to clipboard operation
unsloth copied to clipboard

[GRPO] Changing QLoRA to LoRA or increasing num_gen does not affect VRAM

Open jackswl opened this issue 9 months ago • 2 comments

As mentioned in title, for GRPO, changing QLoRA to LoRA didn't affect VRAM

When I change num_gen from 4 to 8, it did not affect any VRAM. When I change 8 to 16, it increased the VRAM by only 4GB. Something seems off here.

jackswl avatar Feb 27 '25 04:02 jackswl

We shaved VRAM by a lot, so it's probably correct!

Are you certain on QLoRA / LoRA? load_in_4bit = True / False? That's a weird one - LoRA should use much more VRAM

danielhanchen avatar Mar 06 '25 10:03 danielhanchen

Hi @danielhanchen, yup. I changed from QLoRA to LoRA, the VRAM is the same

In fact, I changed my num_gen to 16, and the VRAM also is almost similar to num_gen of 4. I don't know what's the version (I can't access anytime soon), but it should be around 7 days ago when this Issue is posted

jackswl avatar Mar 06 '25 11:03 jackswl