Chuang Zhu
Chuang Zhu
/bot run --add-multi-gpu-test
/bot run --add-multi-gpu-test
/bot run --add-multi-gpu-test
/bot run --add-multi-gpu-test
This warning can be ignored , will be fixed in next release.
/bot run --add-multi-gpu-test
same as https://github.com/NVIDIA/TensorRT-LLM/pull/8903
/bot run --add-multi-gpu-test
Whether or not to expose more configurations for trtllm-serve is still under discussion, we may want to do various configurations in a similar way as trtllm-bench.
disaggregated/test_disaggregated.py::test_disaggregated_deepseek_v3_lite_fp8_attention_dp_one_mtp[DeepSeek-V3-Lite-fp8] SKIP (https://nvbugs/5155144) has been waived in the branch ,please enable it and run ci