Chuang Zhu comments

Results 20 comments of


                                            Chuang Zhu

chore: Ucx ip port remove mpi depend

/bot run --add-multi-gpu-test

chore: Ucx ip port remove mpi depend

/bot run --add-multi-gpu-test

chore: Ucx ip port remove mpi depend

/bot run --add-multi-gpu-test

chore: Ucx ip port remove mpi depend

/bot run --add-multi-gpu-test

"Trying to remove block n by 0 that is not in hash map" spam in release 0.17

This warning can be ignored , will be fixed in next release.

[https://nvbugs/5685143][fix] avoid cudaFree overlap with cuda graph

/bot run --add-multi-gpu-test

[https://nvbugs/5685143][fix] avoid cudaFree overlap with cuda graph

same as https://github.com/NVIDIA/TensorRT-LLM/pull/8903

feat: Adding UCX support for cacheTransceiver

/bot run --add-multi-gpu-test

feat: add chunked context/prefill runtime option to trtllm-serve

Whether or not to expose more configurations for trtllm-serve is still under discussion, we may want to do various configurations in a similar way as trtllm-bench.

fix: Fix an error related to dummy request when MTP is used

disaggregated/test_disaggregated.py::test_disaggregated_deepseek_v3_lite_fp8_attention_dp_one_mtp[DeepSeek-V3-Lite-fp8] SKIP (https://nvbugs/5155144) has been waived in the branch ,please enable it and run ci