user-0a
user-0a
+1 for Command R Plus! CohereForAI/c4ai-command-r-plus
Would also like support for this! Thank you for all of the hard work @ncomly-nvidia
Thank you!
Very interested in this!
+1, seeing this error as well
+1, would also like support for this!
This can likely be implemented with the Executor API: https://github.com/NVIDIA/TensorRT-LLM/blob/31ac30e928a2db795799fdcab6be446bfa3a3998/examples/cpp/executor/README.md#L4