Jee Jee Li comments

Results 206 comments of


                                            Jee Jee Li

[Model][LoRA]LoRA support added for MiniCPMV2.5

> I'm able to repro the failure in `test_llama.py` locally (it passes on `main` branch) but not `test_minicpmv.py`. This PR should not affect the llama. I will also verify and...

[Model][LoRA]LoRA support added for MiniCPMV2.5

@DarkLight1337 Which GPU do you use locally? I can successfully run the llama and minicpmv2.5 LORA tests on the local A800 GPU.

[Model][LoRA]LoRA support added for MiniCPMV2.5

I also encountered a similar problem with a failed Llama test about a month ago. See: https://buildkite.com/vllm/ci-aws/builds/6358#01912bcf-2a2f-4655-9f58-c3f5ae8ea68a

[Model][LoRA]LoRA support added for MiniCPMV2.5

> @DarkLight1337 Which GPU do you use locally? I can successfully run the llama and minicpmv2.5 LORA tests on the local A800 GPU. @DarkLight1337 I had issues with my previous...

[Model][LoRA]LoRA support added for MiniCPMV2.5

> Can you successfully run the TP tests locally? I have tested `test_minicpmv_tp.py` locally, and passed successfully.

[Bug]: fused_moe_kernel compile bug

cc @njhill

[Bug]: fused_moe_kernel compile bug

> I was able to work around this bug by building the latest Triton code from source. It seems that the triton main has updated the related code, I don't...

[Bug]: fused_moe_kernel compile bug

> So I've been digging into this a bit more and here is a summary of my findings: > > * Triton recently released v3.0.0, but it does **not** seem...

[Misc] Qwen2.5 VL support LoRA

> Is there a HF repo that can be used to test this? Not yet, if needed, I can train and add tests

[Misc] Qwen2.5 VL support LoRA

> > > @imkero @wulipc do you have any LoRA-tuned models that can be used? cc @ywang96 > > > > > > Sorry I don't have one currently >...