TensorRT-LLM icon indicating copy to clipboard operation
TensorRT-LLM copied to clipboard

[fix] Eagle-2 LLMAPI pybind argument fix.

Open jhaotingc opened this issue 7 months ago • 49 comments

Description

A bug is triggered when launching Eagle-2 with LLMAPI (trtllm-serve, trtllm-bench).
Eagle-2 expects 2 arguments, --use_dynamic_tree and --dynamic_tree_max_topK.
Eagle-1 expects --eagle_choices.
Without the fix, pybind expects --eagle_choices to be not None, while the check here asserts --eagle_choice to be none when --use_dynamic_tree=True (Eagle-2 is used).

This is introduced by TRTLLM-2867, MR 8312.

Test Coverage

  • examples/llm-api/llm_eagle2_decoding.py
  • tests/integration/defs/test_e2e.py
    • test_llmapi_example_eagle2_decoding
  • tests/unittest/bindings/test_executor_bindings.py
    • test_eagle_config_pickle
  • tests/unittest/llmapi/test_llm.py
    • test_llm_api_eagle2

GitHub Bot Help

/bot [-h] ['run', 'kill', 'skip', 'reuse-pipeline'] ...

Provide a user friendly way for developers to interact with a Jenkins server.

Run /bot [-h|--help] to print this help message.

See details below for each supported subcommand.

run [--disable-fail-fast --skip-test --stage-list "A10-1, xxx" --gpu-type "A30, H100_PCIe" --add-multi-gpu-test --only-multi-gpu-test --disable-multi-gpu-test --post-merge --extra-stage "H100_PCIe-[Post-Merge]-1, xxx"]

Launch build/test pipelines. All previously running jobs will be killed.

--disable-fail-fast (OPTIONAL) : Disable fail fast on build/tests/infra failures.

--skip-test (OPTIONAL) : Skip all test stages, but still run build stages, package stages and sanity check stages. Note: Does NOT update GitHub check status.

--stage-list "A10-1, xxx" (OPTIONAL) : Only run the specified test stages. Examples: "A10-1, xxx". Note: Does NOT update GitHub check status.

--gpu-type "A30, H100_PCIe" (OPTIONAL) : Only run the test stages on the specified GPU types. Examples: "A30, H100_PCIe". Note: Does NOT update GitHub check status.

--only-multi-gpu-test (OPTIONAL) : Only run the multi-GPU tests. Note: Does NOT update GitHub check status.

--disable-multi-gpu-test (OPTIONAL) : Disable the multi-GPU tests. Note: Does NOT update GitHub check status.

--add-multi-gpu-test (OPTIONAL) : Force run the multi-GPU tests. Will also run L0 pre-merge pipeline.

--post-merge (OPTIONAL) : Run the L0 post-merge pipeline instead of the ordinary L0 pre-merge pipeline.

--extra-stage "H100_PCIe-[Post-Merge]-1, xxx" (OPTIONAL) : Run the ordinary L0 pre-merge pipeline and specified test stages. Examples: --extra-stage "H100_PCIe-[Post-Merge]-1, xxx".

kill

kill

Kill all running builds associated with pull request.

skip

skip --comment COMMENT

Skip testing for latest commit on pull request. --comment "Reason for skipping build/test" is required. IMPORTANT NOTE: This is dangerous since lack of user care and validation can cause top of tree to break.

reuse-pipeline

reuse-pipeline

Reuse a previous pipeline to validate current commit. This action will also kill all currently running builds associated with the pull request. IMPORTANT NOTE: This is dangerous since lack of user care and validation can cause top of tree to break.

jhaotingc avatar Apr 29 '25 20:04 jhaotingc

/bot run --disable-fail-fast

jhaotingc avatar Apr 29 '25 20:04 jhaotingc

/bot run --disable-fail-fast

jhaotingc avatar Apr 29 '25 20:04 jhaotingc

/bot run

katec846 avatar Apr 29 '25 20:04 katec846

PR_Github #3756 [ run ] triggered by Bot

tensorrt-cicd avatar Apr 29 '25 20:04 tensorrt-cicd

/bot kill

katec846 avatar Apr 29 '25 20:04 katec846

PR_Github #3761 [ kill ] triggered by Bot

tensorrt-cicd avatar Apr 29 '25 21:04 tensorrt-cicd

PR_Github #3756 [ run ] completed with state ABORTED

tensorrt-cicd avatar Apr 29 '25 21:04 tensorrt-cicd

PR_Github #3761 [ kill ] completed with state SUCCESS Successfully killed previous jobs for commit a3ec04e

tensorrt-cicd avatar Apr 29 '25 21:04 tensorrt-cicd

/bot run --disable-fail-fast

katec846 avatar Apr 29 '25 21:04 katec846

PR_Github #3763 [ run ] triggered by Bot

tensorrt-cicd avatar Apr 29 '25 21:04 tensorrt-cicd

/bot kill

katec846 avatar Apr 29 '25 21:04 katec846

PR_Github #3764 [ kill ] triggered by Bot

tensorrt-cicd avatar Apr 29 '25 21:04 tensorrt-cicd

PR_Github #3763 [ run ] completed with state ABORTED

tensorrt-cicd avatar Apr 29 '25 21:04 tensorrt-cicd

PR_Github #3764 [ kill ] completed with state SUCCESS Successfully killed previous jobs for commit 3219e2e

tensorrt-cicd avatar Apr 29 '25 21:04 tensorrt-cicd

/bot run --disable-fail-fast

katec846 avatar Apr 29 '25 21:04 katec846

PR_Github #3765 [ run ] triggered by Bot

tensorrt-cicd avatar Apr 29 '25 21:04 tensorrt-cicd

LGTM, Thanks for the fix!

yweng0828 avatar Apr 30 '25 03:04 yweng0828

PR_Github #3765 [ run ] completed with state SUCCESS /LLM/main/L0_MergeRequest_PR pipeline #2666 completed with status: 'FAILURE'

tensorrt-cicd avatar Apr 30 '25 06:04 tensorrt-cicd

/bot run --disable-fail-fast

jhaotingc avatar Apr 30 '25 07:04 jhaotingc

/bot run --disable-fail-fast

katec846 avatar Apr 30 '25 20:04 katec846

PR_Github #3888 [ run ] triggered by Bot

tensorrt-cicd avatar Apr 30 '25 20:04 tensorrt-cicd

PR_Github #3888 [ run ] completed with state SUCCESS /LLM/main/L0_MergeRequest_PR pipeline #2758 completed with status: 'FAILURE'

tensorrt-cicd avatar May 01 '25 01:05 tensorrt-cicd

/bot run --disable-fail-fast

jhaotingc avatar May 02 '25 02:05 jhaotingc

/bot run --disable-fail-fast

katec846 avatar May 02 '25 06:05 katec846

PR_Github #3961 [ run ] triggered by Bot

tensorrt-cicd avatar May 02 '25 06:05 tensorrt-cicd

PR_Github #3961 [ run ] completed with state SUCCESS /LLM/main/L0_MergeRequest_PR pipeline #2807 completed with status: 'SUCCESS'

tensorrt-cicd avatar May 02 '25 10:05 tensorrt-cicd

/bot reuse-pipeline

symphonylyh avatar May 02 '25 18:05 symphonylyh

PR_Github #3984 [ reuse-pipeline ] triggered by Bot

tensorrt-cicd avatar May 02 '25 18:05 tensorrt-cicd

PR_Github #3984 [ reuse-pipeline ] completed with state SUCCESS Reusing PR_Github #3961 for commit 7781f44

tensorrt-cicd avatar May 02 '25 18:05 tensorrt-cicd

/bot run --disable-fail-fast

jhaotingc avatar May 05 '25 16:05 jhaotingc