Li Xiaozhe

Results 10 comments of Li Xiaozhe

Same question! I also find that on finetine_lora.sh script provided by haotianliu learning rate is 2e-4, so we should change it to 2e-5?

Same question! Which one to report? There is no 'key | cand/anchor | anchor | cand 'in my output.

I change learning rate from 2e-4 to 2e-5. It is really work!

Using lora finetuing on llava_mix_665k?

@haotian-liu My training hyperparameters remain consistent with you provided. Here are my partial train logs and MME results: 100%|██████████| 10396/10396 [22:39:17

Some question! Could you tell me how to train motionlora?

补充:在Qwen2.5VL上面正常,感觉这有可能是Qwen3-VL的image.processor的问题?它处理image后为空,导致它走了text only的分支

补充:使用vllm+megatron运行成功,但是sglang+fsdp仍然报错