Yu-won Lee comments

Results 230 comments of


                                            Yu-won Lee

BUG of finetune_cls. ValueError: Some specified arguments are not used by the HfArgumentParser

Sorry, I made a quick fix for that. It should be at the data_args, but I've made a mistake when refactoring the code.

DAPO and GSPO plan

I'm currently bit busy so, I'm not sure for it. But I think GSPO is supported by trl in default so, you could just add one or 2 line for...

DAPO and GSPO plan

You can add `--importance_sampling_level "sequence_token"` to use GSPO. Also the deafult loss for grpo is DAPO so I think you could just leave it.

Qwen 2.5 ommni support request

I'm not really familiar with the audio type of thing. Need some study for it.

variable lenght images

You could see this and make your own dataset. https://github.com/QwenLM/Qwen2.5-VL/blob/main/cookbooks/document_parsing.ipynb It's a bit difference with others when you are using grounding jobs.

Yes, that would be more easy. You could remove this part for sure to pass the original image size. https://github.com/2U1/Qwen2-VL-Finetune/blob/04305d230217a4facd2c54aade95c8a9e4765f4c/src/dataset/data_utils.py#L79-L80

variable lenght images

Yes. Grouding task is a bit complicated comparing to other tasks. Maybe I'll make a pipeline for grounding in near future.

[bug?] Crashed on ValueError: absolute aspect ratio must be smaller than 200, got 227.55555555555554

You can add a try-except in the dataset something like this. ``` def __getitem__(self, i): try: # the original code except ValueError as e: # log & skip; DataLoader will...

GRPO training for caption prediction evaluated using BELU score

You could make a reward function like this ``` from sacrebleu import corpus_bleu from rouge_score import rouge_scorer _bleu = lambda cand, ref: corpus_bleu(cand, [ref]).score / 100.0 _rouge = rouge_scorer.RougeScorer(["rougeL"], use_stemmer=True)...

GRPO training for caption prediction evaluated using BELU score

1. The format is right. The content betwwen `` and `` is literally the thinking process and answering. You could take a look at other reasoning dataset to make your...