SaltyEggXman
SaltyEggXman
Yes. The previous version works. This seems to be a new issue in the new 0.7.1 release.
@wjj19950828 Which version of the tensorrtllm folder you used for the replacement? I am using the folder in `23.11-trtllm-python-py` to replace the one in `23.12-trtllm-python-py`. It has the same error....
Yes, tried in 0.9.0, it is supported now. What is the relationship between `--use_fp8_context_fmha` and `--context_fmha enable`. If I use `--use_fp8_context_fmha`, do I still need --context_fmha enable`? In general, is...
Do I need to rebuild the docker image for the main branch? I noticed that there were some files updated under docker folder. From the notes, it seems there are...
I reused the docker image built for 0.5.0 branch and generated a new container with the following command: `REPOSITORY TAG IMAGE ID CREATED SIZE tensorrt_llm/release latest-root 5e43c4749c11 41 hours ago...
Tried with the latest main branch code and rebuilt the docker image. TP size = 4 still gave me the following errors: ``` [12/12/2023-00:21:08] [TRT-LLM] [E] Current weight shape is...
In 0.5 release, the generated file is .pt. Why in this version, it is generated as .npz file?
I have exact the same issue. Any update on this?
It seems you have to run ` make -C docker release_run LOCAL_USER=1` instead of `make -C docker run LOCAL_USER=1`. Otherwise, it will look for the devel version of the image...