jiapingW comments

Results 28 comments of


                                            jiapingW

SGLang Launch error with Qwen2.5-14B draft model, need help

> [@jiapingW](https://github.com/jiapingW) Let me take a look into it. You can follow the code here. https://github.com/sgl-project/sglang/pull/10517

EAGLE3 on Qwen2.5-VL / Qwen3-VL shows extremely low accept length (accept_len ≈ 1)

> Hi, thanks for your great work on SGLang and SpecForge! > > I am trying to test https://huggingface.co/Rayzl/qwen2.5-vl-7b-eagle3-sgl on Qwen2.5-VL using the reference configs from: [#102](https://github.com/sgl-project/SpecForge/pull/102) , but the...

EAGLE3 on Qwen2.5-VL / Qwen3-VL shows extremely low accept length (accept_len ≈ 1)

I test use sglang==0.5.4. The result is below which is OK. ```python Created temporary image directory: .cache/mmstar_specforge Loaded 100 questions. 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 100/100 [00:44

jiapingW

SGLang Launch error with Qwen2.5-14B draft model, need help

EAGLE3 on Qwen2.5-VL / Qwen3-VL shows extremely low accept length (accept_len ≈ 1)

EAGLE3 on Qwen2.5-VL / Qwen3-VL shows extremely low accept length (accept_len ≈ 1)

EAGLE3 on Qwen2.5-VL / Qwen3-VL shows extremely low accept length (accept_len ≈ 1)

[RFC]: Integrate USP (Ulysses + Ring Attention) for Context Parallelism in SpecForge

[RFC]: Integrate USP (Ulysses + Ring Attention) for Context Parallelism in SpecForge

[RFC]: Integrate USP (Ulysses + Ring Attention) for Context Parallelism in SpecForge

[RFC]: Integrate USP (Ulysses + Ring Attention) for Context Parallelism in SpecForge