TensorRT-LLM icon indicating copy to clipboard operation
TensorRT-LLM copied to clipboard

fix: Fix an error related to dummy request when MTP is used

Open jinyangyuan-nvidia opened this issue 9 months ago • 3 comments

The error is fixed by setting max_num_draft_tokens when creating dummy requests.

jinyangyuan-nvidia avatar Mar 29 '25 08:03 jinyangyuan-nvidia

/bot run --add-multi-gpu-test

jinyangyuan-nvidia avatar Mar 29 '25 08:03 jinyangyuan-nvidia

PR_Github #687 [ run ] triggered by Bot

tensorrt-cicd avatar Mar 29 '25 08:03 tensorrt-cicd

PR_Github #687 [ run ] completed with state SUCCESS /LLM/main/L0_MergeRequest_PR pipeline #576 completed with status: 'FAILURE'

tensorrt-cicd avatar Mar 29 '25 15:03 tensorrt-cicd

/bot run --add-multi-gpu-test

jinyangyuan-nvidia avatar Mar 30 '25 03:03 jinyangyuan-nvidia

PR_Github #695 [ run ] triggered by Bot

tensorrt-cicd avatar Mar 30 '25 03:03 tensorrt-cicd

PR_Github #695 [ run ] completed with state SUCCESS /LLM/main/L0_MergeRequest_PR pipeline #582 completed with status: 'FAILURE'

tensorrt-cicd avatar Mar 30 '25 09:03 tensorrt-cicd

/bot run

jinyangyuan-nvidia avatar Mar 30 '25 09:03 jinyangyuan-nvidia

PR_Github #698 [ run ] triggered by Bot

tensorrt-cicd avatar Mar 30 '25 10:03 tensorrt-cicd

/bot kill

jinyangyuan-nvidia avatar Mar 30 '25 10:03 jinyangyuan-nvidia

PR_Github #700 [ kill ] triggered by Bot

tensorrt-cicd avatar Mar 30 '25 10:03 tensorrt-cicd

PR_Github #698 [ run ] completed with state ABORTED

tensorrt-cicd avatar Mar 30 '25 10:03 tensorrt-cicd

PR_Github #700 [ kill ] completed with state SUCCESS Successfully killed previous jobs for commit cee8ad3

tensorrt-cicd avatar Mar 30 '25 10:03 tensorrt-cicd

/bot run --add-multi-gpu-test --disable-fail-fast

Shixiaowei02 avatar Mar 31 '25 02:03 Shixiaowei02

PR_Github #718 [ run ] triggered by Bot

tensorrt-cicd avatar Mar 31 '25 02:03 tensorrt-cicd

PR_Github #718 [ run ] completed with state SUCCESS /LLM/main/L0_MergeRequest_PR pipeline #599 completed with status: 'FAILURE'

tensorrt-cicd avatar Mar 31 '25 04:03 tensorrt-cicd

/bot run --add-multi-gpu-test --disable-fail-fast

jinyangyuan-nvidia avatar Mar 31 '25 15:03 jinyangyuan-nvidia

/bot run --add-multi-gpu-test --disable-fail-fast

jinyangyuan-nvidia avatar Apr 01 '25 00:04 jinyangyuan-nvidia

PR_Github #811 [ run ] triggered by Bot

tensorrt-cicd avatar Apr 01 '25 00:04 tensorrt-cicd

PR_Github #811 [ run ] completed with state SUCCESS /LLM/main/L0_MergeRequest_PR pipeline #657 completed with status: 'FAILURE'

tensorrt-cicd avatar Apr 01 '25 03:04 tensorrt-cicd

/bot run --add-multi-gpu-test --disable-fail-fast

jinyangyuan-nvidia avatar Apr 01 '25 05:04 jinyangyuan-nvidia

PR_Github #848 [ run ] triggered by Bot

tensorrt-cicd avatar Apr 01 '25 05:04 tensorrt-cicd

/bot kill

jinyangyuan-nvidia avatar Apr 01 '25 08:04 jinyangyuan-nvidia

/bot run --add-multi-gpu-test --disable-fail-fast

jinyangyuan-nvidia avatar Apr 01 '25 08:04 jinyangyuan-nvidia

PR_Github #877 [ kill ] triggered by Bot

tensorrt-cicd avatar Apr 01 '25 08:04 tensorrt-cicd

PR_Github #848 [ run ] completed with state ABORTED

tensorrt-cicd avatar Apr 01 '25 08:04 tensorrt-cicd

PR_Github #877 [ kill ] completed with state SUCCESS Successfully killed previous jobs for commit 0d9cb53

tensorrt-cicd avatar Apr 01 '25 08:04 tensorrt-cicd

PR_Github #878 [ run ] triggered by Bot

tensorrt-cicd avatar Apr 01 '25 08:04 tensorrt-cicd

disaggregated/test_disaggregated.py::test_disaggregated_deepseek_v3_lite_fp8_attention_dp_one_mtp[DeepSeek-V3-Lite-fp8] SKIP (https://nvbugs/5155144)

has been waived in the branch ,please enable it and run ci

chuangz0 avatar Apr 01 '25 13:04 chuangz0

/bot run --add-multi-gpu-test --disable-fail-fast

jinyangyuan-nvidia avatar Apr 01 '25 13:04 jinyangyuan-nvidia

PR_Github #906 [ run ] triggered by Bot

tensorrt-cicd avatar Apr 01 '25 13:04 tensorrt-cicd