brb-nv
brb-nv
Can you please provide us with a minimal reproducible example and details in the template description?
No efforts planned for this right now. @1059692261 @tianlinzx , can you please let us know what kind of networks you're working with that are blocked by this? Will help...
@lfr-0531, do you think you can have a look?
I'm guessing you're using the CPP workflow? It's supported in our Pytorch workflow. Please give it a shot: - https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples/pytorch#supported-models - https://github.com/NVIDIA/TensorRT-LLM/blob/main/tensorrt_llm/_torch/models/modeling_qwen2vl.py#L453
Hi, looks like you're still following the classical TRT codepath. I don't see a mention of `qwen2_5_vl` in [multimodal_model_builder.py](https://github.com/NVIDIA/TensorRT-LLM/blob/main/tensorrt_llm/tools/multimodal_builder.py#L108). So, I don't think it's supported there. Can you please try...
Hi @amukkara @symphonylyh, I'm descoping this MR to have only changes for Phi-4-mini model. Changes for Phi-4-MM will be done in this MR: https://github.com/NVIDIA/TensorRT-LLM/pull/3177 I've addressed all your comments (including...