Kaiyu Xie comments

Results 71 comments of


                                            Kaiyu Xie

Fix: fuse message not aligned on different processes

/bot reuse-pipeline

chore: upgrade transformers to 4.50.0

/bot run

fix wrong arg in Engine Building Command in docs/source/performance/perf-overview.md

@RuibaiXu @nv-guomingz Please note that, the latest `docs/source/performance/perf-overview.md` is not using `trtllm-build` command anymore. ([link](https://github.com/NVIDIA/TensorRT-LLM/blob/main/docs/source/performance/perf-overview.md?plain=1)) > the right doc after change shouled be --max_input_len 2048 --max_seq_len 4096 @RuibaiXu Just FYI...

fix: Reverse graph size order

/bot reuse-pipeline

support for newer checkpoints

/bot run --stage-list "Build-Docker-Images"

Incorrect Relative Path to constraints.txt in bloom/requirements.txt

I see that this is going to be addressed by https://github.com/NVIDIA/TensorRT-LLM/pull/3003. @Pradeep-18062002 Thanks a lot for reporting the issue and help fix it!

Qwen2 VL cannot be convert to checkpoint on TensorRT-LLM

> HI [@xunuohope1107](https://github.com/xunuohope1107) , Please add `processor = AutoProcessor.from_pretrained(self.args.hf_model_dir)` in `tensorrt_llm/runtime/multimodal_model_runner.py`. > > Thanks. Hi @sunnyqgg , I see that the issue is still being reported by the users, can...

Kaiyu Xie

Fix: fuse message not aligned on different processes

chore: upgrade transformers to 4.50.0

fix wrong arg in Engine Building Command in docs/source/performance/perf-overview.md

fix: Reverse graph size order

support for newer checkpoints

Incorrect Relative Path to constraints.txt in bloom/requirements.txt

Qwen2 VL cannot be convert to checkpoint on TensorRT-LLM

perf: [AutoDeploy] Enable AutoDeploy as a backend in trtllm-bench

perf: [AutoDeploy] Enable AutoDeploy as a backend in trtllm-bench

perf: [AutoDeploy] Enable AutoDeploy as a backend in trtllm-bench