fix: Fix an error related to dummy request when MTP is used
The error is fixed by setting max_num_draft_tokens when creating dummy requests.
/bot run --add-multi-gpu-test
PR_Github #687 [ run ] triggered by Bot
PR_Github #687 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #576 completed with status: 'FAILURE'
/bot run --add-multi-gpu-test
PR_Github #695 [ run ] triggered by Bot
PR_Github #695 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #582 completed with status: 'FAILURE'
/bot run
PR_Github #698 [ run ] triggered by Bot
/bot kill
PR_Github #700 [ kill ] triggered by Bot
PR_Github #698 [ run ] completed with state ABORTED
PR_Github #700 [ kill ] completed with state SUCCESS
Successfully killed previous jobs for commit cee8ad3
/bot run --add-multi-gpu-test --disable-fail-fast
PR_Github #718 [ run ] triggered by Bot
PR_Github #718 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #599 completed with status: 'FAILURE'
/bot run --add-multi-gpu-test --disable-fail-fast
/bot run --add-multi-gpu-test --disable-fail-fast
PR_Github #811 [ run ] triggered by Bot
PR_Github #811 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #657 completed with status: 'FAILURE'
/bot run --add-multi-gpu-test --disable-fail-fast
PR_Github #848 [ run ] triggered by Bot
/bot kill
/bot run --add-multi-gpu-test --disable-fail-fast
PR_Github #877 [ kill ] triggered by Bot
PR_Github #848 [ run ] completed with state ABORTED
PR_Github #877 [ kill ] completed with state SUCCESS
Successfully killed previous jobs for commit 0d9cb53
PR_Github #878 [ run ] triggered by Bot
disaggregated/test_disaggregated.py::test_disaggregated_deepseek_v3_lite_fp8_attention_dp_one_mtp[DeepSeek-V3-Lite-fp8] SKIP (https://nvbugs/5155144)
has been waived in the branch ,please enable it and run ci
/bot run --add-multi-gpu-test --disable-fail-fast
PR_Github #906 [ run ] triggered by Bot