lukec comments

Results 21 comments of


                                            lukec

[Feat] Support FlashMLA backend with MTP and FP8 KV cache

Do you have the results of the accuracy test?

[Feat] Support FlashMLA backend with MTP and FP8 KV cache

* fix util error when egale config=314 @quinnrong94 > For Future PRs: > > * Do some profiling and check whether there is any bubble caused by synchronization between CPU...

Support qwen3 deepep

> Do we need raise error for bf16 when enable deepep? I'm not sure, it's necessary? @zhyncs @ch-wan

[Feature] support ep for DeepSeek V3

> expect this feature On the way on the way

Development Roadmap (2025 H2)

> Greate job! If I want to participate in VLM, what can I do? You can contact me on Slack. We have an eagle-vlm team. My slack is Chao Wang....

Development Roadmap (2025 H2)

> Interested in supporting DS V3/R1, who should I reach out to? You can search for specforge in the sgl project of slack.

feat: added low VRAM flash attention backend

How much performance improvement does flex attention offer in comparison?

Added mistral model support

Could you fix the code format @ValeGian

Add Eagle3 training for more MLLM model

Great job!!!!! This enables us to support all MLLM models. @FrankLeeeee

Support Train Eagle-3 By DeepSpeed

Is the training speed improved compared to the original implementation?