Yu-won Lee comments

Results 230 comments of


                                            Yu-won Lee

Attempting to unscale FP16 gradients Error

Could show me your dataset example and your training script? I think the optimizer or deepspeed config has something wrong with it. I gotta check for the error.

Attempting to unscale FP16 gradients Error

I haven't tested with it but, I could think one thing that you could check with upcasting the model to fp32. Also mainly the code is set for deepspeed so...

Warning: Setting `save_embedding_layers` to `True`

Are you trying to finetune the embedding too?

How to fine-tune for multi-label classification tasks

The code is not for multi-label so you need to tweak a bit in the `cls_dataset.py`.

How to fine-tune for multi-label classification tasks

@BingfengHan @Bentonmaster For multi-label classification, you should change the label part in the `cls_dataset.py`. The dataset format should look like ``` "label": [ class1: 0 class2: 1 class3: 1 ]...

Image features and image tokens do not match during SFT fine-tuning

Your dataset have no `` token. That might be the issue.

Finetuning with LoRA and video training data.

Are you measuring the exact token answer or the perplexity? Also, the model could have problem like "catastrophic forgetting".

Finetuning with LoRA and video training data.

Did you merge the lora weights? Or it could be a issue that updates to the lora weight was insufficient. You could increase the rank and the alpha

Finetuning with LoRA and video training data.

The fact that accuracy falls while loss → 0 is usually a sign that either 1. The adapter weights are not the ones being evaluated, or 2. Generation‐time prompt /...

Finetuning with LoRA and video training data.

You should freeze the merger too. https://github.com/2U1/Qwen2-VL-Finetune/blob/d5ebc1a417b1d1bf1da24ba8c20e8406356cdcb2/scripts/finetune_lora_vision.sh#L17 Also If you were using the unmerged weight then check if you have properly loaded the merger(projection) weight.