Conghui Tan comments

Results 7 comments of


                                            Conghui Tan

Contributing to scipy?

Hi, Thanks for your suggestion. Well, I will be glad if my algorithms can be a part of scipy. But I'am afraid that they are not suitable. My methods are...

Contributing to scipy?

Do you know there is already a Python library (lighting)(http://contrib.scikit-learn.org/lightning/) which solves the same problems?

[Bug] DeepSeek R1 serve crash occasionally on 2*H100

I met the same issue. "--disable-cuda-graph" works for me. However, adding this option greatly slows down the inference speed in low QPS setting.

[Bug] DeepSeek R1 serve crash occasionally on 2*H100

The timeout error is because the inference is already stucked or crashed. Setting a longer timeout doesn't help here. I did some debug, and found the exact line of code...

deepseek-R1 AssertionError occurred in the batch request of the client

Do you have a plan to fix this issue? we need batch API in our scenario. > Oh. Currently do not use batch in dpsk models. We find this problem....

deepseek-R1 AssertionError occurred in the batch request of the client

Thanks, FrankLeeeee. I also noticed this issue. But maybe it is better use a UUUID stead of the custom_id as the request id? For example, if two batches are processing...

Instruction for Running DeepSeek with Large-scale PD and EP

I try to run it on H20, but I encountered the following error when capturing cuda graph on the decoding nodes. Adding --disable-cuda-graph can fix it, but the decoding speed...