unsloth icon indicating copy to clipboard operation
unsloth copied to clipboard

Grpotrainer cannot find the "pad_token_id" when using Qwen2.5-VL-72B-Instruct model

Open doggystyle-star opened this issue 9 months ago • 4 comments

I try to I tried to run the (Qwen2.5_(3B)-GRPO.ipynb) replace the model Qwen2.5-3B to Qwen2.5-VL-72B, but it encountered this error

"qwen2vlprocessor object has no attribute 'pad_token_id' ".

but in the preprocesser_config.json the "pad_token_id" is set 151643 how can I fix it? here is my package vision: huggingface-hub==0.28.1 pillow==10.4.0 transformers==4.49.0 triton==3.1.0 trl==0.15.1 unsloth==2025.2.12 unsloth-zoo=2025.2.5 VLLM==0.7.2

Image

Image

doggystyle-star avatar Feb 21 '25 02:02 doggystyle-star

Oh my - will work on a fix!

danielhanchen avatar Feb 22 '25 03:02 danielhanchen

Have you encountered a situation where the merged fine-tuned model can't find the local model? If so, how did you resolve it?"

jia-zhen-yu avatar Feb 23 '25 00:02 jia-zhen-yu

Have you encountered a situation where the merged fine-tuned model can't find the local model? If so, how did you resolve it?"

No, I havent yet, maybe you should add before: model, tokenizer = FastVisionModel.from_pretrained( "{YOUR_MODEL}", load_in_4bit = True, # Use 4bit to reduce memory use. False for 16bit LoRA. use_gradient_checkpointing = "unsloth", # True or "unsloth" for long context )

doggystyle-star avatar Feb 24 '25 02:02 doggystyle-star

Image maybe the source of the problem is trl(version==0.15.1), the above is I replace the pad_token_id in Qwen-VL-72B but the new problem is occur:

Image also in trl==0.15.1

doggystyle-star avatar Feb 24 '25 03:02 doggystyle-star

@doggystyle-star we fixed some issues for qwen2.5vl let us know if youre sitll having the issue. you can reopen!

shimmyshimmer avatar Mar 27 '25 08:03 shimmyshimmer