Prashanth Nandavanam
Prashanth Nandavanam
I've run into this myself. I'm attempting to deploy LLama3 and Gemma, and keep running into these issues when generating the engines. Will there be an update/fix any time soon?...
@geraldstanje - I JUST got gemma to deploy (yet to test), after much trial and error. Getting to the correct version number combination for Triton, TensorRT, TensorRT-LLM, and tensorrt_llm_backend involved...
Thanks, @CarterYancey - good to know I wasn't the only one suffering. I did make sure the versions were the same. As you pointed out, the documentation is not accurate,...