Yu-won Lee comments

Results 230 comments of


                                            Yu-won Lee

Finetuning with Multiple jsons

Okay I'll give it a try. Thanks for the comment.

Could you please add GME-Qwen2-VL finetuning code?

I don't know what exactly is a gme version. Also, what probelm did you encounter?

Can We Speed Up Batch Inference?

Yes, vllm and sglang could be a good one to speed up the inference. Also you could use static_cache and compile for it, but you should use a fixed batch...

Can We Speed Up Batch Inference?

@yangfy2023 You could make an Dataset class similar to training dataset. or You could just make an pipeline for it. It's not so difficult.

how to set batch size in the code instead of shell script?

You could directly adjust in he training_args in the `train.py`.

Issue with enabling merger parameters along with LoRA

Sorry for the inconvinience. It could be a little confused. If the `--vision_lora` is not set to true, then the code automatically adds the keyword `visaul` in the list (`merger`...

QLoRA AttributeError: 'Parameter' object has no attribute 'SCB'

It looks like 8bit has this problem but the answer is not that useful. https://github.com/bitsandbytes-foundation/bitsandbytes/issues/454#issuecomment-1636964951 I'll find some other way for this. Thanks for letting me know.

QLoRA AttributeError: 'Parameter' object has no attribute 'SCB'

It seems like this is occured when I've updated the library versions. It may work when downgrading some of the libraries but the code won't work well. I'll find some...

How to load the videos dataset and the configurations

No it dosen't load all datas at once. The log for qwen-vl-utils would show once in the first step. The memory warning is literally a warning that it cause oom...

How to load the videos dataset and the configurations

Sorry I haven't tried multi-node that I have only one machine. So, I couldn't try to solve the problem. Sorry again for the inconvinience.