Kero Liang comments

Results 13 comments of


                                            Kero Liang

关于cdn加速节点没有本地。。。

感谢建议，这边在着手重构这个主题，pjax等功能会考虑提供支持。 CDN 加速节点，全部挂掉的可能性还是比较低的，如果遇到某个 CDN 不可用的情况，可以自行在主题设置先切换。不过，的确只依赖 CDN 的话，在本地开发时可能会有一些不便。如果目前有需要的话，可以把依赖的文件传到主题目录，然后暂时更改一下 CDN 的指向。

""[Here’s Ways To Watch]* UFC 314 ＬＩＶＥ 𝐒𝐭𝐫𝐞𝐚𝐦𝐬 𝙵𝚛𝚎𝚎 𝖮𝗇𝗅𝗂𝗇𝖾 ＯＮ Official ＴｖＣｈａｎｎｅｌ 12 April 2025

> Nice work! I wonder whether it would've been easier to land this contribution in two separate PRs, one with the CUDA implementation and a follow-up with the numba one...

[Misc] Qwen2.5 VL support LoRA

> @imkero @wulipc do you have any LoRA-tuned models that can be used? cc @ywang96 Sorry I don't have one currently

[Perf] Optimize MRotaryEmbedding::get_input_positions performance by numba

> @imkero Thanks for the PR! This is amazing 🚀 > Could you please resolve the merge conflict and the lint error? Sure! I will update this PR soon. TODO:...

[Perf] Optimize MRotaryEmbedding::get_input_positions performance by numba

> @imkero Just so you know: To fix the CI failure, we should move `numba` from `requirements/cuda.txt` and `requirements/rocm.txt` to `requirements/common.txt`. Thanks for your remind, I have moved it to...

[Perf] Optimize MRotaryEmbedding::get_input_positions performance by numba

@WoosukKwon I think this PR is ready for review now

[Perf] Optimize MRotaryEmbedding::get_input_positions performance by numba

I will take get_next_positions_tensor into consideration because they are reported to be time-costing as well in #17617

[Perf] Optimize MRotaryEmbedding::get_input_positions performance by numba

I have written an optimized version of `get_next_input_positions_tensor` which is zero copy and no lru cache. Will add e2e benchmarking result like #17617