Yu-won Lee comments

Results 230 comments of


                                            Yu-won Lee

Countless environment bugs when running finetune.sh

@JasonLeeUT @7998857 I'll upload my docker image in the dockerhub. I was using cuda 12.4 (via docker) and updated the flash-attn into the lastest one right now. Maybe there are...

Countless environment bugs when running finetune.sh

I've uploaded a pre-build image to simplfying the setup.

Countless environment bugs when running finetune.sh

I've uploaded the docker image and how to use it in the readme. Also mentioned about the system env. Thanks for debugging!

CUDA illegal memory access error

Sorry, I haven't seen the error. I'll check what is wrong with it. Thanks for letting me know.

ValueError: Image features and image tokens do not match: tokens: 300, features 298

``` [ { "id": "000000033471", "image": "000000033471.jpg", "conversations": [ { "from": "human", "value": "\nWhat are the colors of the bus in the image?" }, { "from": "gpt", "value": "The bus...

Finetuning script for Gemma3

@Gopi-Uppari Thanks for the interest in the project! Your expirience would be very valuable for developing the project. Feedbacks and issues are always welcome!

Validation Set During Training

Yes you could, but you need to fix a the code a little bit, in `data.py` and `parametrs.py`.

Phi-3-Vision Batch Inference Prompt format

The model does support batch however, the processor dosen't so you should make a function for handling bathc inference. For example it is my code when using litserve. ``` def...

Finetuning feature added for setting `vision_lr` and `resampler_lr`

Can you make this PR merged? This would be helpful for pepole who wants to finetune the model, following some other papers like LLaVA-Next. By my experience, setting the learning...

Why Is Special Handling Required for LoRA with ZeRO Stage 3?

yes it's becuase of lora layers are only applied to LLMs and the non-lora layers should be saved too (such as vision encoder).