Yu-won Lee

Results 230 comments of Yu-won Lee

From the latest prints: * **Before PEFT:** `get_input_embeddings().weight.shape == torch.Size([0])` and same for output — so the **base checkpoint already has empty embeddings**. * After loading the adapter, shapes remain...

The code was for `transformers==4.51.3` and I haven't tested with other versions. I'll make an update for it soon.

I'm not really familiar with GRPO, so I'll check about this one. Sorry for the issue.

1. I will found out what is the reason for the performance. 2. I've finetuned the model but it worked for me before, I will test this againg too. 3....

Thanks for letting me know. When I was making this repo 2.5.8 was the stable one for me. If this works then I'll update the code and the env. Thanks!

It looks like it's okay for using the latest flash-attn.

I think something is mismatched. Can you give a sample for your dataset that I could check for this one?

https://github.com/2U1/Qwen2-VL-Finetune/issues/152#issuecomment-2999215309 This may be the issue.

Yes you could just erase the "image".