TensorRT-LLM icon indicating copy to clipboard operation
TensorRT-LLM copied to clipboard

The engine generated by each build has different results for the same input.

Open 1096125073 opened this issue 1 year ago • 4 comments

System Info

trt-llm v0.9.0

Who can help?

@byshiue

Information

  • [X] The official example scripts
  • [ ] My own modified scripts

Tasks

  • [X] An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
  • [ ] My own task or dataset (give details below)

Reproduction

  1. build the engine for test 1
  2. build the engine for test 2
  3. run above 2 engine use the same input

Expected behavior

the ouputs is same

actual behavior

the ouputs is not same

additional notes

I tried to use the model cache when building, but it didn't work.

1096125073 avatar Aug 23 '24 10:08 1096125073

Is there any way to ensure that the engine generated by build is identical?This is important for engineering deployment.

1096125073 avatar Aug 23 '24 10:08 1096125073

Can you provide more details, i.e the cmds, which can help us reproduce this issue?

lfr-0531 avatar Sep 04 '24 04:09 lfr-0531

Can you provide more details, i.e the cmds, which can help us reproduce this issue?

I encountered same issue. I trtllm-build 2 times with everything identical, but the inference results are slightly different between 2 models for the same input. Also I found a similar problem post by others. https://github.com/NVIDIA/TensorRT-LLM/issues/2196

qiancheng99 avatar Sep 06 '24 06:09 qiancheng99

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 15 days."

github-actions[bot] avatar Oct 07 '24 02:10 github-actions[bot]

This issue was closed because it has been stalled for 15 days with no activity.

github-actions[bot] avatar Oct 23 '24 02:10 github-actions[bot]