Jiarui Fang（方佳瑞） comments

Results 220 comments of


                                            Jiarui Fang（方佳瑞）

It seems the pipeline parallel document is out of date(https://www.colossalai.org/docs/features/pipeline_parallel)

@Gy-Lu @ver217 @binmakeswell can you update the PP doc?

Megatro-LM适配代码

> 你好👋我目前在做长文本的相关研究，想要咨询下yunchang有没有可以适配Megatron-LM的代码？ https://github.com/FlagOpen/FlagScale/commit/f98ee1e293bd906cc77f512f7a884b2030c10a12 很多人已经把USP弄到megatron-LM里了

_scaled_dot_product_efficient_attention a bug for lse

感谢 @neonhuang ！您能交一个 MR 么？如果 torch 版本＜2.3 执行你粘贴的代码？

usage with pytorch FSDP

The parallel group for USP and FDSP should be the same. You can wrap the USP applied module with FSDP.

Bug when using vllm async rollout

How did you use the vllm async rollout? Could you post a test script?

fix import flashinfer error on AMD GPUs

Hi dose this https://github.com/feifeibear/long-context-attention/pull/150 PR solve the problem.

[Bug] Qwen2 Eagle serving error

I pull the latest main branch with the triton backend ``` python -m sglang.launch_server --model-path /demo-huabei2/common-models/DeepSeek-R1-Distill-Qwen-7B --disable-radix-cache --host 127.0.0.1 --port 1235 --tensor-parallel-size 1 --speculative-algo EAGLE --speculative-draft /demo-huabei2/common-models/EAGLE/EAGLE-Qwen2-7B-Instruct --speculative-num-steps 5 --speculative-eagle-topk...

Jiarui Fang（方佳瑞）

It seems the pipeline parallel document is out of date(https://www.colossalai.org/docs/features/pipeline_parallel)

Megatro-LM适配代码

_scaled_dot_product_efficient_attention a bug for lse

usage with pytorch FSDP

Bug when using vllm async rollout

fix import flashinfer error on AMD GPUs

[Bug] Qwen2 Eagle serving error

[Bug] Qwen2 Eagle serving error

Memory Leakage with USP and Transformer Blocks

Memory Leakage with USP and Transformer Blocks