feat : reduce trt engine build time in testing
The default max tactic number to autotune is (when builder_config.trt_builder_config.max_num_tactics == -1 and trt will choose heuristic path): 100 for SM90 60 for other architectures This number may have slight impact on the e2e perf, but no impact on the functional tests and would save nearly 50% test time on related tests.
/bot run
PR_Github #261 [ run ] triggered by Bot
PR_Github #261 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #252 completed with status: 'FAILURE'
/bot run
PR_Github #317 [ run ] triggered by Bot
PR_Github #317 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #301 completed with status: 'FAILURE'
/bot run
PR_Github #351 [ run ] triggered by Bot
PR_Github #351 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #322 completed with status: 'FAILURE'
/bot run
PR_Github #374 [ run ] triggered by Bot
PR_Github #374 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #338 completed with status: 'FAILURE'
/bot run
PR_Github #476 [ run ] triggered by Bot
PR_Github #476 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #409 completed with status: 'SUCCESS'
/bot reuse-pipeline
PR_Github #506 [ reuse-pipeline ] triggered by Bot
PR_Github #506 [ reuse-pipeline ] completed with state SUCCESS
Reusing PR_Github #476 for commit 2794aad