Lil2J comments

Results 27 comments of


                                            Lil2J

When will the EAGLE3 support for Qwen2

We have successfully trained the Eagle3 versions of Qwen3-8B and Qwen3-30B-A3B based on the official training code, and have open-sourced them. On a single H200 GPU using the sglang inference...

When will the EAGLE3 support for Qwen2

@garycaokai @luoruijie @chtaihei-ust-hk

Training of Qwen2

We have successfully trained the Eagle3 versions of Qwen3-8B and Qwen3-30B-A3B based on the official training code, and have open-sourced them. On a single H200 GPU using the sglang inference...

Eagle Training Parameters and Hardware Requirements

We have successfully trained the Eagle3 versions of Qwen3-8B and Qwen3-30B-A3B based on the official training code, and have open-sourced them. On a single H200 GPU using the sglang inference...

Using kvcached + sglang + qwen-fp8 directly causes an out-of-bounds error. [bug]

My machine has 8 × B200 GPUs, but I only used one B200.

Using kvcached + sglang + qwen-fp8 directly causes an out-of-bounds error. [bug]

@ivanium

Using kvcached + sglang + qwen-fp8 directly causes an out-of-bounds error. [bug]

Thank you very much for your reply. I’m also looking into the kvcached code and would like to contribute to fixing this bug. From my perspective, this project truly has...

Using kvcached + sglang + qwen-fp8 directly causes an out-of-bounds error. [bug]

> Thanks for digging into the code! We totally agree that quantization is a must. We'd love to collaborate if you are interested in helping with the integration. Please feel...

Eagle-3 for LLAMA4

We have successfully trained the Eagle3 versions of Qwen3-8B and Qwen3-30B-A3B based on the official training code, and have open-sourced them. On a single H200 GPU using the sglang inference...

Eagle-3 for LLAMA4

@tchaton